Gene Cagg_1992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1992 
Symbol 
ID7268908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2433943 
End bp2435538 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content45% 
IMG OID643566823 
Producthypothetical protein 
Protein accessionYP_002463316 
Protein GI219848883 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00260165 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCACACCT CTCAATCAAT GCGAGGCAAA AAGCTCTTTA TTGACGCAGC AATTATTTGC 
TTCTTAATAA TGGCTTGGTT CAAATTATGC ATGGGAATAG AATCAAGTTT GGAAGTAGAC
ACTGCTGCCG ATGAGACACT CTACCTCTAT TCAGGAATCA CGCAAACCAC CCCCGCCGAC
TACGGCCCGC TGTATGGCCT ATGGTATTGG CTGCTCTGGT GGACTACGCC CGATCGCATC
GACCTCTACT ATTTGAACTG GCGCATGACC ACTCTCTTAC CGGTTCTCGC ATTTTACTTG
ATCTGCCGAC TTAATCGAGT TACACCACTG GTTAGTGCTG TCGCAGCATG GTTATTACTC
ATCTCGTCGA TCAACGTATC GACATGGCCG CGAGTATCAC ACCTTGCGCT CTTCATAGTG
TTGCTAAGTC TCTCCGCGAT TAGTCTGTTA CGATCACGGA GTCGGAGCAG CTTGGCTATC
GCTACCGGCT TGCTGGCAGC TAGCTATGTT CGACCGGAAT TGTTTCTTTC CTACGTTGCT
GGATTAGGTG TCGTTCTGAT TGATCTCATC CGTGATTATC GCCAAAAACA GCTATTGCCA
TGGCTAACAA TGATCATAAC TGGCTTCGTG CAACTAGCAC TCCTGATTTG GCAAGGGGTT
CCAATGACAG GCGAGCGTAG TTTCGTTGCA TTCTCTCAAC ACTTTGCAAC AAGCTGGATA
ACATGGAACA ACAGTCAGCT CGATCCTTGG AACGATTTCC CATATATTAT GCAGACTGCT
TTTGGCGACG ACGTAGATAC CGTTTGGGAA GCTTTTTTGG CCAACCCGAT GTTAATTCTA
CGTCATATGG TACAGAATGT CATCCGATTG TACGGTATTG CAACGCTATT ACCTGTAGGT
GTCGTTCACT ACTCCGCCAC CGCTGATCGA ATAGACAAAC TCTTCTTGTG GGTAGTTCTC
TTCACAGCAC TGGGAGTATT TCTGATGACC CTTCATCTCA TTCGTCGCTC AATTAGCGAA
CTACTTTTTC GCAAAGAGCA CAACCGCTTG CGAGTATGGA TAATTATTTG TACATTACTA
GTCTTTATAT CGATAATATC CATCTATCCA AGGCCTCACT ATTTACTTTT ACTTATACTT
CCTCTGCTTT TCTTTGTTAT AGTTGTGTAT ACCGCTAATC AACCACTAAC ACCACCGAGA
CTACCTGAAT TAGTATTAAC AGTTTCATTA ATGTTCTTGC TAACTCCAAT GCCATGGTGG
AGTACCAACA ATCAATGGCA AACGCCGGCA CTACGCTTCT TGAATACGCT AAACTCTGTG
CAACAGGCTC AATTAACAGT TTTAGTACCA CTTGACATCG CTGGACTTTA TATCCCAACG
TTCAAAAAGA TAATAGCATA CCATGAAAAC GGTAATGAAT TCATGGATCT CCTTGACGAA
GCCGACATCG TTATTATCGG TAGTGTATCT TCCAGCGAGA CGATCCGGCG CTTTACCAGT
TATCCGCAAC AGTACGGCTT CGAACCTATA CTTGAACCGT ACATCCCAAG TTTATTTATA
CGATCAGAGC ATCGTCAACT ATTCTCAACG AAATGA
 
Protein sequence
MHTSQSMRGK KLFIDAAIIC FLIMAWFKLC MGIESSLEVD TAADETLYLY SGITQTTPAD 
YGPLYGLWYW LLWWTTPDRI DLYYLNWRMT TLLPVLAFYL ICRLNRVTPL VSAVAAWLLL
ISSINVSTWP RVSHLALFIV LLSLSAISLL RSRSRSSLAI ATGLLAASYV RPELFLSYVA
GLGVVLIDLI RDYRQKQLLP WLTMIITGFV QLALLIWQGV PMTGERSFVA FSQHFATSWI
TWNNSQLDPW NDFPYIMQTA FGDDVDTVWE AFLANPMLIL RHMVQNVIRL YGIATLLPVG
VVHYSATADR IDKLFLWVVL FTALGVFLMT LHLIRRSISE LLFRKEHNRL RVWIIICTLL
VFISIISIYP RPHYLLLLIL PLLFFVIVVY TANQPLTPPR LPELVLTVSL MFLLTPMPWW
STNNQWQTPA LRFLNTLNSV QQAQLTVLVP LDIAGLYIPT FKKIIAYHEN GNEFMDLLDE
ADIVIIGSVS SSETIRRFTS YPQQYGFEPI LEPYIPSLFI RSEHRQLFST K