Gene Cagg_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1837 
Symbol 
ID7267749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2253845 
End bp2254927 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content56% 
IMG OID643566673 
Productaminodeoxychorismate lyase 
Protein accessionYP_002463168 
Protein GI219848735 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCATG TCCGCGCCCT CTTACTCGGT CTTTCTCTCC TGGCACTGGT AGTCTCGTGT 
GCGGGTTATG TCTTCTTGAG TGAATTGCGG GCGACACCGG CGACTACCAA CACACCGGTT
GAGTTTATCG TCGCCCCCGG TGAAACAACC AACGATATTG CGAATCGGTT GGCCGAAGCC
GGTCTGATCC GCCAACCGGC CCTCTTTCGC GCACTGGTCC GCTGGCGCGG TCTCGATCAG
CAGATACAGG CCGGGCGCTA TGTACTCAGC CCGACGATGA CAATGAGCGA GATTCTGATC
GTCTTACAGA GCGGAAAGGT GGTCAACGAT ATTCAGATCA CGATCCCGGA AGGATTGCGT
CTCGAAGAGA TCGCTGCGAT TATCGCCGCT GCCGGCCTCG TGAGCGAAAA CGATTTTTTG
ACCGTTGCAC GTGACGGCGA CCGATTCCGT GCAGATTATT TTCTGCTTAA TAGCTTGCCG
GAAGGGGCGA CACTCGAAGG CTATCTCTTC CCCGATACCT ATCGGTTTGC ACCCTCGTCT
GATGCCGAAA CCATCGTGCG TAAGCTACTC GACCGCTTTG TTGAGCAGTA TAGTACGATT
GAGCGTTCGG TCCGGGTACC GGGTGTTACC GTCCATCAGA TCGTCACAAT GGCGTCGATT
GTCCAACGTG AGGCAGCTCT CCTCAGCGAG ATGCCACGTA TTAGCGCGGT CTTCTGGAAT
CGTCTCAAAC CGCAATATGC CCCCATCTTC GGCGGGGGGT TGCTCGGCGC CGATGCGACG
GTACAGTATG CGATTGGCTA TGATCCCGGT GAAGGTACGT GGTGGAAACG TAATCTGACC
GTTGACGATC TGGCGATTCA AAGCCCGTAC AATACGCGCA TCAATCCCGG TTTGCCACCA
GGCCCAATTG CTGCTCCCGG CCTTGCTGCG CTCACGGCTG CGGCTCAGCC CGATGAATCG
TCGCCCTATC TGTTTTTTGT CGCCAGTTGC GAGTTTGATG GTTCACATAA GTTTGCAACG
ACTATCGAAG AGTTTCGTGT CTATGAAGCG GAGTGGTTGG CGTGCCAGCA GAATCGACCC
TAA
 
Protein sequence
MRHVRALLLG LSLLALVVSC AGYVFLSELR ATPATTNTPV EFIVAPGETT NDIANRLAEA 
GLIRQPALFR ALVRWRGLDQ QIQAGRYVLS PTMTMSEILI VLQSGKVVND IQITIPEGLR
LEEIAAIIAA AGLVSENDFL TVARDGDRFR ADYFLLNSLP EGATLEGYLF PDTYRFAPSS
DAETIVRKLL DRFVEQYSTI ERSVRVPGVT VHQIVTMASI VQREAALLSE MPRISAVFWN
RLKPQYAPIF GGGLLGADAT VQYAIGYDPG EGTWWKRNLT VDDLAIQSPY NTRINPGLPP
GPIAAPGLAA LTAAAQPDES SPYLFFVASC EFDGSHKFAT TIEEFRVYEA EWLACQQNRP