Gene GSU0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0337 
SymbolhemL 
ID2687321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp366273 
End bp367556 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content57% 
IMG OID637125003 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionNP_951397 
Protein GI39995446 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00652219 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCACAG CTCGCTCGAA GGATCTTTTC ACGCAAGCCC AGGAATTCAT TCCCGGTGGA 
GTAAACAGTC CCGTACGCGC CTTCAAGTCA GTGGGCGCCG ACCCTCTTTT CATTAAAAAA
GCGTTTGGTT GCACAATTAC TGACGCCGAC AACAACAGTT ATATCGATTA CGTCGGCTCC
TGGGGCCCCA TGATTCTCGG CCACTGCCAC CCCCAGGTTG TCGAAGCCGT AAAGCGGGCC
GTCGAAAGTG GCAGCAGCTT TGGCGCCCCC ACGGAGCTGG AAATCACCCT TGCCCGAATG
GTCATCGACG CGGTGCCCTC AATCGAGATG GTTCGTATGG TCAGCTCCGG AACAGAGGCG
ACCATGAGTG CCATTCGGCT TGCCCGCGGC TACACCGGCC GGGATAAAAT CATCAAATTC
TCAGGCTGCT ATCATGGCCA TGCCGACGCA CTCCTGGTGA AGGCTGGCTC AGGGGCTGCC
ACGTTTGGGG TGCCCGATTC GCCGGGAGTT CCCGTCGATG TGGCAAAAAA CACGCTCACT
GCTCAGTTCA ACGATCTTGA TTCGGTTTCG AAGCTTATTG ATGAGAACAA GAATGAGATC
GCGTGCATCA TTGTCGAGCC TATTGCCGGT AATATGGGCA CCGTTCCGCC GGGTGAAGGT
TTTCTCGAAG GGCTCCGCTC CATCTGCGAC AGCGAGGGAA TTGTCCTCAT TTTCGACGAG
GTAATGACCG GCTTCCGTGT TGCCTATGGT GGAGCTCAGG AACTTTACGG TGTAACTCCC
GACATGACCA CCCTCGGCAA GATCATCGGC GGTGGGCTTC CCGTCGGCGC CTTCGGCGGC
AAAAAGGACA TCATGAAGCT TCTCTCCCCG TCCGGAGGCG TTTATCAGGC GGGCACGCTG
TCGGGCAACC CCCTCGCCAT GACAGCTGGT ATCGAGACCC TCAAACTGCT TCAGGCAGAC
GGGTTCTATG AGCAACTGGA GCAAACGAGC CGCCGCCTTG CCGAGGGCAT CACCGAGGCG
GCCAAATCCG CCGGGTACCC CATCTATCCG ACCCGCGTCG GCAGCATGTT CTGCACCTTT
TTCACCAGCA ACGAGGTCAA GGACTGGCCC ACGGCCACAA CCTGCGACAC AAAGGCATTC
GCCGCTTTTT TCAGAATGAT GCTCGAGAAG GGGATCTATC TGGCTCCTTC GCAGTTTGAA
ACGGCCTTCG TCTCGATCGC CCACACCGAG GTGGAAATCG AGAAGACGAT TGTTGCGGCC
CGCTCATGCT TTGCCGCTCT CTAA
 
Protein sequence
MLTARSKDLF TQAQEFIPGG VNSPVRAFKS VGADPLFIKK AFGCTITDAD NNSYIDYVGS 
WGPMILGHCH PQVVEAVKRA VESGSSFGAP TELEITLARM VIDAVPSIEM VRMVSSGTEA
TMSAIRLARG YTGRDKIIKF SGCYHGHADA LLVKAGSGAA TFGVPDSPGV PVDVAKNTLT
AQFNDLDSVS KLIDENKNEI ACIIVEPIAG NMGTVPPGEG FLEGLRSICD SEGIVLIFDE
VMTGFRVAYG GAQELYGVTP DMTTLGKIIG GGLPVGAFGG KKDIMKLLSP SGGVYQAGTL
SGNPLAMTAG IETLKLLQAD GFYEQLEQTS RRLAEGITEA AKSAGYPIYP TRVGSMFCTF
FTSNEVKDWP TATTCDTKAF AAFFRMMLEK GIYLAPSQFE TAFVSIAHTE VEIEKTIVAA
RSCFAAL