Gene GM21_3019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3019 
Symbol 
ID8138365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3506887 
End bp3508104 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content58% 
IMG OID644870620 
Productaspartate kinase 
Protein accessionYP_003022806 
Protein GI253701617 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones111 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGG TGGTCCAAAA ATACGGCGGA ACCTCGATGG GTTCTGTCGA ACGGATTCGT 
AACGTCGCTA AGAGGGTGGC GAAAACCTAC GATGCCGGCA ACGACATGGT GGTGGTAGTC
TCCGCCATGT CCGGCGAAAC TAACAAGCTG GTGGCCCTTG CCAACGAAGT CTGCGAATTC
CCGGACAACC GTGAATACGA CGTGCTGGTT GCAGCAGGCG AGCAGGTTTC CATCGCGTTG
CTGGCCATGT GCCTTAAATC CATGGGGTAC AAAGCGAAAT CCTACTTGGG CTTCCAGGTT
CCCATTCTGA CCGATAGCGC CTACGCAAAG GCGCGCATCG AGAAGATCGA CGACGCCAAG
ATGCGCACCG ATTTGAAGGA AGGGACCATC CTCATCGTCG CGGGTTTCCA GGGTGTCGAC
CCGTCCGGCA GCGTCACAAC GCTTGGGCGC GGCGGCTCGG ACACCTCGGC GGTCGCTCTC
GCGGCGGCTC TGAAAGCGGA CGTTTGCGAA ATATTCACCG ACGTTGACGG GGTCTACACC
ACCGATCCCA ACATCTGCAA GGACGCGAAG AAGATCGAGC GCATCTCCTA CGAGGAGATG
CTGGAGCTGG CCAGCCTGGG CGCCAAGGTG CTCCAGATCC GCTCCGTCGA ATTCGCCAGC
AAGTACAACG TCGACGTCCA TGTCCGCTCA AGCTTTAACG AAAATCTCGG AACCATGGTT
ACCAAGGAGG ATAAAGATAT GGAAGCAGTA CTCGTCTCGG GTATCGCCTA TGCCAAGGAT
GAAGTGAAAA TAGCTGTGAT GCAGGTTCCG GACAAGCCGG GGATCGCCGC CCAGATCCTG
TCGCCGCTCT CCGATGCCAA TATCTCCGTG GACATGATCG TTCAGAACGT GAGCGAGGCC
GGTTCCACCG ACTTCACCTT CACCGTGCCC CAGGCCGAAT TCAAGAAGGC GCTGGCCATA
ACCCAGGAGA CCGCCCAGGC GATCAACGCC AAGGAAGTCC TCTCCGACGA GAACGTGAGC
AAGGTCTCCA TCGTTGGCCT CGGCATGAGG AGCCACGCAG GGGTCGCCAC CACCATGTTC
AAGGCGCTCG CCGCGGAAGG GATCAACATC CAGATGATCT CCACCTCCGA GATCAAGATC
TCCGTCGTCG TCGACGCGAA GTACACCGAG CTCGCCGTAA GGGTGCTGCA CGACGTCTTC
GGCCTGTCGG GGAAATAA
 
Protein sequence
MALVVQKYGG TSMGSVERIR NVAKRVAKTY DAGNDMVVVV SAMSGETNKL VALANEVCEF 
PDNREYDVLV AAGEQVSIAL LAMCLKSMGY KAKSYLGFQV PILTDSAYAK ARIEKIDDAK
MRTDLKEGTI LIVAGFQGVD PSGSVTTLGR GGSDTSAVAL AAALKADVCE IFTDVDGVYT
TDPNICKDAK KIERISYEEM LELASLGAKV LQIRSVEFAS KYNVDVHVRS SFNENLGTMV
TKEDKDMEAV LVSGIAYAKD EVKIAVMQVP DKPGIAAQIL SPLSDANISV DMIVQNVSEA
GSTDFTFTVP QAEFKKALAI TQETAQAINA KEVLSDENVS KVSIVGLGMR SHAGVATTMF
KALAAEGINI QMISTSEIKI SVVVDAKYTE LAVRVLHDVF GLSGK