Gene GM21_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4011 
Symbol 
ID8139385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4596134 
End bp4597417 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content64% 
IMG OID644871627 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_003023785 
Protein GI253702596 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.49886e-31 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAAAACA GCCGCTCGAC CAAACTCTTT CAGCAGGCGC TTCAGTCCAT CCCCGGCGGC 
GTCAACAGCC CGGTGCGCGC CTTCAGGTCC GTTGGCTCCG ATCCGCTCTT CATCAAAAAG
GCGTTAGGCC CCCGCATCTA TGATGAAGAT GGCAACACCT TCATCGACTA CGTCGGCTCC
TGGGGTCCGA TGATCCTTGG GCACTGCCAC CCGCAGGTCG TGGCCGCCAT CAAGGCCGCC
GTCGACAACG GCGCCAGCTT CGGCGCGCCC ACCGAGCTAG AGATCACGCT GGCCGAGATG
GTGATCGATG CGGTCCCCTC CATCGAGATG GTGCGCATGG TGAGTTCCGG TACCGAGGCG
ACCATGAGCG CCATCAGGCT CGCCCGCGGC TACACCGGCC GCGACAACAT CCTGAAGTTC
TCCGGTTGCT ACCACGGCCA CTCCGACGCG CTTTTGGTCA AAGCCGGATC CGGCGCCGCC
ACCTTCGGCG TCCCCGACTC CCCCGGCGTC CCCGCCGACC TCGCCAAGCA CACGCTGACC
GCGACCTACA ACGACCTCGA CTCGGTCCGG GCGCTGGTGG CGGCCAACAA GGGGAGCATC
GCCTGCATCA TCGTGGAGCC TGTGGCTGGC AACATGGGAA CAGTCCCCCC CAAGGAAGGA
TTCCTGGAAG GGCTTAGGAG CATCTGCAGC GAGGAAGGGA TCGTGCTGAT CTTCGACGAG
GTGATGTCCG GCTTCAGGGT TGCCTACGGC GGCGTTCAGG AACTCTACGG CGTGACCCCC
GACATGACCA CGCTGGGCAA GATCATCGGC GGCGGTCTGC CGGTGGGGGC GTTCGGCGGG
AAAAAAGAAA TCATGTCCCT TCTTTCACCG GCGGGGGGAG TGTATCAGGC CGGGACCCTC
TCTGGCAACC CCCTGGCCAT GACCGCCGGG ATCGAGACCT TGAAGCTCCT CAAGGAGCCT
GGGTTCTATC AGAAGCTGGA AGAAAAGAGC GCCTTCGTGG CGGAGGGGAT CGCAAAGGCC
GCCAGGGACG CCGGTTTCCC GATCTACTCC ACGCGGGTAG GCTCCATGTT CTGCGCCTTT
TTCTCCAAGG ATCCCGTCTA CGACTGGGAC AGCGCCGCCA AGTGCGACAC CAAGGCCTTC
GCCGCCTACT TCAAGGCGAT GCTGAACGAA GGGATCTACC TGGCGCCTTC GCAGTTCGAG
ACCGCGTTCG TCGGCATCTC CCACAGCACG GAGGACCTGG AGCAGACCAT CGCGGCGGCC
GCCAAGTGCT TTAAGGCGCT GTAG
 
Protein sequence
MQNSRSTKLF QQALQSIPGG VNSPVRAFRS VGSDPLFIKK ALGPRIYDED GNTFIDYVGS 
WGPMILGHCH PQVVAAIKAA VDNGASFGAP TELEITLAEM VIDAVPSIEM VRMVSSGTEA
TMSAIRLARG YTGRDNILKF SGCYHGHSDA LLVKAGSGAA TFGVPDSPGV PADLAKHTLT
ATYNDLDSVR ALVAANKGSI ACIIVEPVAG NMGTVPPKEG FLEGLRSICS EEGIVLIFDE
VMSGFRVAYG GVQELYGVTP DMTTLGKIIG GGLPVGAFGG KKEIMSLLSP AGGVYQAGTL
SGNPLAMTAG IETLKLLKEP GFYQKLEEKS AFVAEGIAKA ARDAGFPIYS TRVGSMFCAF
FSKDPVYDWD SAAKCDTKAF AAYFKAMLNE GIYLAPSQFE TAFVGISHST EDLEQTIAAA
AKCFKAL