Gene Cagg_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1919 
Symbol 
ID7266410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2352090 
End bp2353319 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content56% 
IMG OID643566756 
ProductCBS domain containing protein 
Protein accessionYP_002463250 
Protein GI219848817 
COG category[S] Function unknown 
COG ID[COG1993] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0042523 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGAAC CAACGGTGCA ACGGGTACGC ATCTATTTAA ATGGTGAAGA TAGCGCAGAC 
GGTCAACCGC TCTATAAAGT GGTAGTAGAC GAATTGCGTC AGAGTGGTGC GACCGGTGTG
ACGGTTCTGC AGGCTTTGAC CGGCTTCGGA CCACGTCGGC AGATGTTACC CAACGCGATG
CGGCAGCCGG TCGTAATCGA GTGGGTTGAC AATGCCGTTC GTATCCAACG ATTACTGCCG
TTGCTGAACC GCTTGATCGG TGATGCGCTG GTAACGATCG AGCCGGTAGC CATTGTGCAG
GGTGTACTGC GTCCGGCCGG TCCGTTCAGT GCCGCCCAAT TAGTCTCCGA TCTGATGCAA
GCAGACGCAC CGGTGATTGA CGCCACTGCA CCACTCCTCG ATGTCCTCGA ACCGTTTATC
ACCGGTCGGG TGGAAGTATT GGCAGTAGTT GAGAATGATA CCGTCATCGG CACTATCTCG
CTGCGTGAAT TAGTGTGGCG TGCCGGTTTA CGGGTACCAC CTTACCTGCT GAGTATGCTT
GAGCCGGCCG AACGAGCGGC AGTGCTGGCA CCGCTACAGG CATTAACTGC CGGTGCGATT
ACTAATCGTG AGATACGTGG TGTGCATACA ACGATGCCGA TTACGCAAGC CCTGACCCGT
ATGATCGAGT GGGGCTACAA CCAAATACCG GTGCTCGATC CGCTCGGGCG GTTGGCCGGG
GTGTTCGGGC AACACGAGGT GTTGCAGGCA GTAGCGCACC GGTCTGAATC GGAGGAAGCT
ACCGGTTTGG AACTCCAGGT GGGAATGGTG ATGCAAGCGG CAACAGCACG GGCTACGCTT
GGTCAATCAT TGGCGACAGC ACTTGCCTTG CTGATCACAA CTCCGGGTCA AAGTTTGTTT
GTCGTTGATG GCGATAGGCG TGTTGTGGGT GTCTTGCGGT TAAGTAAGGT ACTATCCAAT
TTGCAAGACG ACGAGCGAAC GAGCTTACTG ACTGCATTGC AAAGCACTCA ACGAGTCCAG
CCGACAGCGT TGCCTGGGGC ACGTCGCACG ATCGACGCCT TTCTCGAACC ACCACCACCG
GTATTGGCGA TCAATACCTC GCTCGGAGTT GCCGCTCGTC AGTTACTCAC AATGAACACG
GAGCGGTTGC CGGTGGTAGA CTCAGAAGGG CGACTGTCGG GTATAATCGC ACGTGGTGCC
CTCGTGCGGG CATTACTACA ACACGAATAG
 
Protein sequence
MAEPTVQRVR IYLNGEDSAD GQPLYKVVVD ELRQSGATGV TVLQALTGFG PRRQMLPNAM 
RQPVVIEWVD NAVRIQRLLP LLNRLIGDAL VTIEPVAIVQ GVLRPAGPFS AAQLVSDLMQ
ADAPVIDATA PLLDVLEPFI TGRVEVLAVV ENDTVIGTIS LRELVWRAGL RVPPYLLSML
EPAERAAVLA PLQALTAGAI TNREIRGVHT TMPITQALTR MIEWGYNQIP VLDPLGRLAG
VFGQHEVLQA VAHRSESEEA TGLELQVGMV MQAATARATL GQSLATALAL LITTPGQSLF
VVDGDRRVVG VLRLSKVLSN LQDDERTSLL TALQSTQRVQ PTALPGARRT IDAFLEPPPP
VLAINTSLGV AARQLLTMNT ERLPVVDSEG RLSGIIARGA LVRALLQHE