Gene Athe_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2034 
Symbol 
ID7408247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2147577 
End bp2149166 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content40% 
IMG OID643716401 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_002573884 
Protein GI222530002 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTAA GAAGGCCAAA TGCAAATGAA GCAACAGGTA CGTTTAACCG CTCAAAAGAT 
GTTGTGCCAA TGTCAGGAAT TTGTACAAGA TGCGTGGATG GGTGTCAGGG AAACTGTGAA
ATCTTTTTGG CCTCTTTTAG AGGAAGAGAG GTTTTATACC CGGGACCTTT TGGAGAAGTG
ACTGCTGGTG CAGACAAAAA TTATCCTGTT GATTATTCGC ATCTCAATAT TCAGGGATAT
GCTGTTGGAG CAAAAGGGCT TCCTGAAGGA GTTGAACCTA ATCCAGATAC AGCTATTTTC
CCTAATGTTG ATACGACAAC TGAATATGGT TGGGAAAAGA AAGTTAAAAT GAAGGTTCCT
ATATTCACTG GAGCTCTTGG TTCTACAGAA ATTGCACGCA AGAATTGGGA ACATTTTGCT
GTAGGTGCTG CAATTTCAGG TATCACACTT GTTTGCGGAG AAAATGTTTG TGGAGTTGAC
CCTGAACTTG AACTCACATC AGATGGAAAA GTTAAAAAGT CCCCTGAAAT GGATAGAAGG
ATAAATACTT ATAAAAGATT TCATGAGGGC TGGGGCGAAA TATTAGTTCA AATGAACGTT
GAAGATACTC GCCTTGGTGT TGCAGAATAT GTTATTGAAA AACACGGGCT TGATACAATT
GAGCTCAAGT GGGGTCAGGG TGCAAAGTGT ATAGGCGGTG AGATAAAGGT AAAGAGCTTA
GAAAGAGCTT TAGAACTCAA AAAGAGAGGA TACGTTGTTT TGCCCGACCC AACTCAAAAA
GATGTTCAGG AAGCATTTAA AAGAGGTGCT ATAAGAGAAT TTGAAAGACA TTCCAGACTT
GGATTTGTTG AAAAAGAAAG CTTTTTGAAA GAGATTGAAC GGCTCAGAAG ACTTGGATTT
AAGAGAATAA CACTCAAAAC AGGAGCATAT TCAGCTGTTG AACTTGCAAT GGCACTAAGG
TTTGGTGCTG AGGCAAAACT TGATTTAATA ACAATTGATG GTGCACCAGG TGGCACAGGC
ATGAGCCCGT GGCCAATGAT GAATGAATGG GGTATCCCAA CATTCTATTT GGAAGCTCTG
GCATATCAGT TTGCTGAAAA GCTTGCAAAG AAAGGTTTTA GGGTTCCAGA CCTTGCAATC
GCAGGTGGTT TTTCAACAGA AGATGGTGTG TTCAAAGCAA TTGCAATGGG TGCACCATAT
GTAAAAGCTG TTTGTATGGG AAGAGCTTTG ATGATACCAG GTATGGTGGG CAAGAATATT
GAAAAGTGGT TAAAAGAAGG TAATCTACCA AAAACAGTAT CCAAGTATGG TTCAACACCC
GAAGAAATAT TCATAACATA TGAAGAGCTG CGTGAAAAGT ATGGTGATGA GATAAAGAAT
ATACCTCTTG GTGCAATTGG TATCTATACA TTTGTTCAAA AGTTCAAAAC AGGTTTGCAG
CAACTTATGG CAGGGTCAAG AAACTTCAGA ATTTCTACAA TCTCAAGAAA AGACCTCATC
GCTCTTACCG AAGATGCTGC TAAAATATCA GGCATTCCGT ATGTGATGGA TGCATACAGG
GAAGAGGCAG AAAGGATACT TGAAGAATAA
 
Protein sequence
MNLRRPNANE ATGTFNRSKD VVPMSGICTR CVDGCQGNCE IFLASFRGRE VLYPGPFGEV 
TAGADKNYPV DYSHLNIQGY AVGAKGLPEG VEPNPDTAIF PNVDTTTEYG WEKKVKMKVP
IFTGALGSTE IARKNWEHFA VGAAISGITL VCGENVCGVD PELELTSDGK VKKSPEMDRR
INTYKRFHEG WGEILVQMNV EDTRLGVAEY VIEKHGLDTI ELKWGQGAKC IGGEIKVKSL
ERALELKKRG YVVLPDPTQK DVQEAFKRGA IREFERHSRL GFVEKESFLK EIERLRRLGF
KRITLKTGAY SAVELAMALR FGAEAKLDLI TIDGAPGGTG MSPWPMMNEW GIPTFYLEAL
AYQFAEKLAK KGFRVPDLAI AGGFSTEDGV FKAIAMGAPY VKAVCMGRAL MIPGMVGKNI
EKWLKEGNLP KTVSKYGSTP EEIFITYEEL REKYGDEIKN IPLGAIGIYT FVQKFKTGLQ
QLMAGSRNFR ISTISRKDLI ALTEDAAKIS GIPYVMDAYR EEAERILEE