Gene GM21_0751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0751 
Symbol 
ID8136066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp896675 
End bp897745 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content63% 
IMG OID644868368 
Productaminotransferase class V 
Protein accessionYP_003020583 
Protein GI253699394 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.000622725 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACAAGA AGCTGTTCAT CCCGGGACCG ATCGAGGTCA GCCCGGAGAT ACTCAAGGCC 
ATGGGTACGC CGATGATCGG ACACCGCATG CCCGAGTACG CGCAACTGCA CAAGGGCGTG
ACGGACAAGC TGAAGCGGTT GATGTTCACC AAGGAGCGGG TTTTCCTCTC CACCTCCAGC
GCCTTCGGCG CCATGGAGGG TGCGGTCAGA AACCTCGTGG GCAAGCGCTG CGCCAACTTC
TGCAACGGCG CCTTCTCCGA CAAGTGGCAC AACGTCACGC TGGCCTGCGG CAAGGAAGCC
GACCCCTTCA AGGTCGCCTG GGGCGAGCCC ATCACCCCTG AACTGGTCGA TTCGGCGCTT
GCCACCGGCA AGTACGACGC CATCACCCTG ATCCACAACG AGACCTCGAC CGGGGTCATG
TCCCCGCTTC CCGAGATCGC CCAGGTGCTC AAGAAGTACC CGGAGGTGGT CTCCATCATC
GACACCGTCT CCTCCATGAG CGCCCTGAGG CTCCCCGTGG ACGAACTGGG GATCGACTGC
TGCGTCTTCG GCGTGCAGAA GGCGTTCGCC CTCCCGCCGG GGCTCGCCGT CTTCACCGCC
AGCGAGAAGG CCTTGGAACG CGCCAAGGGG GTCCCCGGAC GCGGCTACTA CTTCGACTTC
CTCGAGTTCC TGGCGGCGGA CGAAAAGAAC AACACCCCGT CCACCCCTTG CATCTCGCTC
ATCTACGCGA TGGACCTGCA GTTGGAGCGT ATCTTCGCGG AGGGGCTGGA GAAGAGATGG
GAGCGGCACG CGAGGATGGC CGAGTTCATG CGCGCCTGGG TTAAGGAGCA CGGCTTCGGC
CTCTTCCCGT CGGAAGGGTA CCGCTCGGTC ACCCTTACCT GCGCCTCCAA CGACCGCGGC
GTCGACCTGG GCCTTATGAA GAAGCAGTTG GGCGAACGTG GCTTCGCCTT CGACGACGGC
TACGGCAAAA TCAAGGGGAA AACCTTCCGG GTGGCCCACA TGGGGGACAT GCAGCTGGAA
AACCTCAAGG AAATCACAAC CGAGATGGAG GGGATCCTGC AGGGTCTCTA G
 
Protein sequence
MHKKLFIPGP IEVSPEILKA MGTPMIGHRM PEYAQLHKGV TDKLKRLMFT KERVFLSTSS 
AFGAMEGAVR NLVGKRCANF CNGAFSDKWH NVTLACGKEA DPFKVAWGEP ITPELVDSAL
ATGKYDAITL IHNETSTGVM SPLPEIAQVL KKYPEVVSII DTVSSMSALR LPVDELGIDC
CVFGVQKAFA LPPGLAVFTA SEKALERAKG VPGRGYYFDF LEFLAADEKN NTPSTPCISL
IYAMDLQLER IFAEGLEKRW ERHARMAEFM RAWVKEHGFG LFPSEGYRSV TLTCASNDRG
VDLGLMKKQL GERGFAFDDG YGKIKGKTFR VAHMGDMQLE NLKEITTEME GILQGL