Gene GM21_0068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0068 
Symbol 
ID8135367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp85955 
End bp87124 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID644867685 
Producthypothetical protein 
Protein accessionYP_003019913 
Protein GI253698724 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.19832e-26 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGTATC CCGTTTCGCC GCTGATCTCA GGCGTGCACC CGCCTCCCAT CTCCGAGGTG 
AAGGGTTGGC TCGCCGGGGC CCCGGCCGGG GTTCCCCTCA TCGACCTCTG CCAGGCCATA
CCCGATTACC CCCCGCCGCG GGAGCTCACC GACCACCTGG CCCAGGTGAT GCTCGACCCG
CACACCTCGC GCTACAGCAT CGACGAAGGG CTTCCCGAGG TGCGGGAGGC GGTCTGCGCC
GGGTACCGCG AGCTCTACGG CGCCTGCATC GATCCTGCCC AGCTCATCCT CACCATCGGC
GCCAGCCAGG CTTTCTGGCT CGCCATGGTG ACGCTTTGCC GCGCCGGAGA CGAGGTCATC
GTCCAGCTGC CGGCCTACTT CGACCACCCG ATGGCGCTCG CGGTGCTCGG CATCCGCTGC
GTCTACGCCC CGTTCGAGGA GGAAAGCTGC GGGCTCCCCA GTGTCGCCGC CATAGCCCCC
TTGATCACGG AGAAGACCCG CGCAATTCTG CTGGTCACCC CCAGCAACCC CACCGGCGCC
GTGATACCGC CCGAGACCGT GCGCGAGTTG CACCGCCTCG CCGTCTCCCG CGACATCGCC
TTGGTGCTGG ACGAGACCTA CAACAGCTTC ATCACGGGGG GCGCCCGCCC CCACGACCTG
TTCCAGAAGC CGAATTGGGG GGACCATTTC GTCCACATCG CCTCCTTCGG CAAGACCTTC
GCGCTCACCG GCTACCGCGC CGGGATGCTG GCCGCGTCGG AGGAATTCAT CCGCCACGCG
CTGAAGGCGC AGGACACCAT GGCGGTATGC CAGCCGCGCG TCACACAGCA CGCGGTGAAG
TACGGCTTCG AGCAGCTGGG GGGATGGGTC GCCGCGAACC GGGTCATGAT GGAGAGAAGG
CACGAGGTGT TCCGCGCCGA GTTCGAGAAG CCCGGCAACT CCTTCAAGCT GGTGGCGAGC
GGCCCCTTCT TCGGCTGGGT GCGGCATCCG CTGCGAAATG CCGCGGGGAG GGAGGTCGCC
AGGCGCCTGG TGGAAGAGGC GGGGGTGCTG CTGCTGCCGG GGGAGGTGTT CGGACCGGGG
TTGGAGGGGT ACTTAAGGCT CGCCTTCGGC AACATCAGGG AAGAGACCAT ACCCGAGGCG
GTGAAACGGT TCAGGGAATT CAAAACCTAA
 
Protein sequence
MRYPVSPLIS GVHPPPISEV KGWLAGAPAG VPLIDLCQAI PDYPPPRELT DHLAQVMLDP 
HTSRYSIDEG LPEVREAVCA GYRELYGACI DPAQLILTIG ASQAFWLAMV TLCRAGDEVI
VQLPAYFDHP MALAVLGIRC VYAPFEEESC GLPSVAAIAP LITEKTRAIL LVTPSNPTGA
VIPPETVREL HRLAVSRDIA LVLDETYNSF ITGGARPHDL FQKPNWGDHF VHIASFGKTF
ALTGYRAGML AASEEFIRHA LKAQDTMAVC QPRVTQHAVK YGFEQLGGWV AANRVMMERR
HEVFRAEFEK PGNSFKLVAS GPFFGWVRHP LRNAAGREVA RRLVEEAGVL LLPGEVFGPG
LEGYLRLAFG NIREETIPEA VKRFREFKT