Gene Cagg_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1017 
Symbol 
ID7268389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1256291 
End bp1257352 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content57% 
IMG OID643565863 
ProductHomoserine dehydrogenase 
Protein accessionYP_002462368 
Protein GI219847935 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.698035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCA TTATCCAACT TGGCATCGGT GGCGTCGGAC GAGCTTTAGC ACGACAAATA 
GTAGCCGTCG CGCCTGCCAT TCGCCGACGC TATGGCATAG ATCTACGCTA CATAGCGATT
GCTGATAGTC GCGGTATCAT TGCCGGTGAT CCGACGGTGA GTGAAGAACA AGTACATCAA
ATCCTCGCCG TAAAAGAAGC CGGTCATGGG CTTGATAGTA TGACTAATGC GATCACCGAT
CGGCACTGGA TAGAGTTACT CCCTGCGACA ATAGCAATCG TTGTTGATGT CACGGCAACG
AGTGAACATA CCGCGCCATT GGCCGCAGCC GTCTCGGCAG GTCATCGCGT TGTATTGGCC
AATAAACGCC CGCTATGTGA TGAGTACGAT CTGTTTACCG CGCTCACCGA ACGTGGTGCA
ACCCGCTACG AAGCGACGGT TGGGGCCGGT TTGCCGGTTA TTGGGGTATT ACAGGGCTTG
CTCGACACCG GTGATGAAGT ACTGCGGATC GAAGCGGCCT TGAGTGGTAC GCTCGGCTTT
CTGATGAGCG CGTTAGAAGA GGGTAGCAGT TTTGCCGAAG CGGTATGGAA AGCGCACGCA
CTCGGCTACA CCGAGCCGGA TCCGCGAGAT GACCTGAGCG GAGCTGATGT GGCGCGTAAG
GCGCTGATTT TGGCCCGTAC CTGTGGTATC CCCGTACCGG CTGACGCGGT GAGTGCTGAA
TCGCTCTTCC CACCCCAGCT CGCAACGGTC AGTGTGGCAG AGTTCTTGCA ACGCTTGCCC
GAAGCCGAGG AATCTGTTAT GGAACGCTTT GCCGCGGCCC GTGCCGCCGG CAACGTCTTG
CGGTATATCA CATGCATCAC GCCGGACAAC ATCGAGGTGG GGTTGCGCGA GTTGCCCGCC
GATCATCCGC TCGCCGGTCT GCGTGGCCCC GACAATATGA TCAGCTTCAC CACCCGACGT
TACCACGACC GACCAATGGT GATCCGTGGG CCAGGTGCAG GGGTTGAAGT GACGGCAACC
GGTGTGTTGA GCGATATTAT TGCGACAGCA CGAGAACTGT GA
 
Protein sequence
MTPIIQLGIG GVGRALARQI VAVAPAIRRR YGIDLRYIAI ADSRGIIAGD PTVSEEQVHQ 
ILAVKEAGHG LDSMTNAITD RHWIELLPAT IAIVVDVTAT SEHTAPLAAA VSAGHRVVLA
NKRPLCDEYD LFTALTERGA TRYEATVGAG LPVIGVLQGL LDTGDEVLRI EAALSGTLGF
LMSALEEGSS FAEAVWKAHA LGYTEPDPRD DLSGADVARK ALILARTCGI PVPADAVSAE
SLFPPQLATV SVAEFLQRLP EAEESVMERF AAARAAGNVL RYITCITPDN IEVGLRELPA
DHPLAGLRGP DNMISFTTRR YHDRPMVIRG PGAGVEVTAT GVLSDIIATA REL