Gene GM21_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1887 
Symbol 
ID8137221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2192964 
End bp2194259 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content62% 
IMG OID644869501 
Productadenylosuccinate lyase 
Protein accessionYP_003021698 
Protein GI253700509 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAGAAC GTTACAGCCG TCCTGAAATG GCCCGTATCT GGGAACCCGA AAACCGCTAC 
CGCAAGTGGC TCGAAATAGA AATATACGCC TGCGAGGCGC ACGCAGAGAT GGGGCGCATA
CCCAAGGACG CAGTGGCCCG CATCAAGGCG AAAGCCAACT TCGACGTCCC CCGCATCGAC
GAGATCGAGC GCACCGTCAA GCACGACGTC ATCGCCTTCC TCACCTCCGT CGCCGACTAC
ATCGGCGACG ACTCCCGTTT CGTCCACCTG GGCCTTACCT CCTCCGATGT CCTCGACACC
TCCTTCGCCA TGCTCCTCAA GGAAGCGGGG GAGCTGATCG TAGCCGACAT CAAGCGCCTG
ATGGCCGTCA TCAAGACCCG CGCCTACGAG CACAAGATGA CGCCGCAGAT GGGGCGCTCG
CACGGCATTC ACGCCGAGCC GGTCACCTTC GGCCTGAAGA TGGCGCTTTG GTACGACGAG
ATGGCCAGGA ACCTGAAGCG GATGGAAGCG GCCCTGGAGA CCATCGCCTA CGGCAAGCTC
TCCGGCGCGG TCGGTACCTT CGCCAACATC GACCCGCAGG TCGAGGCTTT CGTCTGCAAG
AAGGCGGGGT TGAAACCCGC CCCCTGCTCC ACGCAGGTGC TGCAGCGCGA CCGCCACGCC
GAATACTTCA CCACCCTGGC GATCATCGCC TCCTCCATCG AGAAGTTCGC CGTCGAGATC
AGGCACCTGC AGCGCACCGA GGTCCTCGAG GCCGAGGAGT TCTTCAGCAA GGGGCAGAAA
GGCTCCTCCG CGATGCCGCA CAAGCGCAAC CCGGTCCTCT CCGAGAACCT GACCGGCCTG
GCCCGCCTGA TCCGAGGCTA TGCGGTCTCC GCCATGGAGA ACGTGCCGCT GTGGCACGAG
CGTGACATCT CGCACTCCTC CGTGGAGCGC ATCATCGGTC CGGACGCAAC CGTGATGCTC
GACTTCATGC TGAACCGCGC CATCGGGCTG ATCGAGAACC TGGTGGTCTA CCCCGAGAAC
ATGATGCGCA ACCTGAACCA GATGCGCGGT CTCATCTTCT CGCAGCGCGT GCTCCTGAAA
CTCGCCGAGG CGGGTGCTTC CCGTGAGAAG GCCTACTCGC TGGTACAAAG AAACGCCATG
AAGGTCTGGG AAGAGGGGAA AGACTTCCAG ACCGAGCTTC TGAACGACGC CGAAGTCGCC
GGCTTCCTCC CCGCCGAGGA GATCAAGGAA GCGTTCGATC TCGGTTACCA TCTGAAACAC
GTCGACACTA TTTTCACGAG GGTCTTCGGT GGATAG
 
Protein sequence
MIERYSRPEM ARIWEPENRY RKWLEIEIYA CEAHAEMGRI PKDAVARIKA KANFDVPRID 
EIERTVKHDV IAFLTSVADY IGDDSRFVHL GLTSSDVLDT SFAMLLKEAG ELIVADIKRL
MAVIKTRAYE HKMTPQMGRS HGIHAEPVTF GLKMALWYDE MARNLKRMEA ALETIAYGKL
SGAVGTFANI DPQVEAFVCK KAGLKPAPCS TQVLQRDRHA EYFTTLAIIA SSIEKFAVEI
RHLQRTEVLE AEEFFSKGQK GSSAMPHKRN PVLSENLTGL ARLIRGYAVS AMENVPLWHE
RDISHSSVER IIGPDATVML DFMLNRAIGL IENLVVYPEN MMRNLNQMRG LIFSQRVLLK
LAEAGASREK AYSLVQRNAM KVWEEGKDFQ TELLNDAEVA GFLPAEEIKE AFDLGYHLKH
VDTIFTRVFG G