Gene Cagg_1596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1596 
Symbol 
ID7268162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1948958 
End bp1951039 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content56% 
IMG OID643566437 
Producthypothetical protein 
Protein accessionYP_002462933 
Protein GI219848500 
COG category[S] Function unknown 
COG ID[COG1306] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000327656 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTTGT CTGTGCGGCG AATCGCTCGC TTTTGGCTAT TCTGTTGTCT GATCGCAGGA 
GTATTGGTGA GCTGTACGAC ACAACCGGTT ATCCTAACCG GCGTCGTTAC CGATGCATAT
ACCGGTCAAC CGGTTGTTGG GGCGACGGTC ACAATCGGTG AGCAGACGCT TACCACCGAT
GATCAAGGTC GGTTTCAGAC AGAACGCTGG CGTCCAGAGG ATACCTTATC GCTCAAAGCA
CCGGCTTATG AGCCGCTCGA AATGCCATTG GTCGATCAGC CCAGAGTAGG TAAGAGCGGG
GTGCTGACGG TAACAATCAC AACCGCTCTG CGTCCGAATG TATTGAGCGG TACGATCACC
GATATGTATA CCGGTCAACC GGTGGGTGGG GCTGAGGTGC GGATCGCCGG TAATGACGGG
ATGAGTGCCG TCACCGATGA GAGCGGTCGG TATACGCTGA CGAATGTACC CGAAGCGTTT
ACGGTGACGG TCAGCGCTCC TCAGTATGCG CCTGTTACCG AGCAAGTGGC GCGGGTTACG
GTGTTCGATA TGAGGCTGCG TCCGAACACG CTGAGCGGTA TCGTTACTAA TAGTTACACC
GGTGAGGCGG TGGCCGGAGC GAAGGTCACG CTTGGTGCAA TTAGTGCGAC GACGGATGAC
ACCGGCCGGT ATCTGTTGCG TGACGTGCCG GCGGAAAGCG GCGAGATTAC GGTGACCGCC
GATGGGTTTG CTTCTGTGAC ACAGCCGTTT ACGCGCACGA CGACGCTCGA TGTCGCCCTT
CGGCCTGATA CTCTGTTTGG TCAACTGATC GACGCTACCA CCGGCAAGCC GGTGCCCAAC
GCAGCGATCA TCGCCACAGA GACGCTCACC TCGACCGCCG TTGCTTTTGT GCGGATTAAC
GACAGTGTTG AAGGCCGGTT TCGCCTGCCG AATCTGCCTG AACGTGGCTT TGTGCAGGTG
TTGGCGCCGG GTTACGCGAA GCGCGTGATT CCGATTGAAC CGGGGAAGAT GCCCGATCGG
ATTGAGCTTG AGCCGTTCTA CGTGCGTGGC ATCTACATCA CGGCGGCAGT GGCGTCGGTG
CCGAGTCTCG TTGACAAATT TCTCGACCTG ATCGATCGTA CCGAGTTGAA CACGCTGGTA
ATCGATATCA AGAGCGATTT ACGCGATGAT CTGGGCGTTG TCTATTACGA TTCACAAGTG
CCGTTAGCCC GCGAACTAGG TCTGAGTACA CCACGGGTCG ATTTTGCTTC CATCTTGGCG
AAGGCAAAAG AGCGTGGCAT CTATACTATT GCCCGCGTCC AACTCTTTTC GCACGATAAT
GCGCTCTCCG ATGCCCGCCC TGAGTGGTCA ATCCGCCTAC GTTCGACCGG TGAGGTGTAT
GCCGATTATC CGGGGCCGGG GATTCGGTAC GCATACCTCG ACCCGACCAA CCAGAATGTG
TGGGATTACA ACATTGCGTT GGCGGTTGAG GCGGCGCAAA TGGGTTTCGA TGAGATCAAT
TTCGATTATA TCCGCTTCCC CGATTGGTTT GGGCCTTATA GTGAATTCAG CGAAAAGCTG
CTCTTTAGTG AGCCGATTGA TCCGGTGACG AACCCCGGTC GAATGTACGA TGTGATCCTT
GAGTTTATGC AGCGGGCTCA CAGCGCAGTT AACGCTGCCG GTGCGTTTAT GTCGATCGAT
GTCTTTGGGC GGGTGGTCAA TGGCCCTTCA CTGACGATTG CCCAAGATAT GACCCGCATG
GGTGAATTTA CCGATTATAT CTGTCCAATG CCTTACCCGT CGTTGTGGTG GGGTGGATTG
GAGAACATTG CGGTGCCGGT GAAATTCCCC TACGAGACGA TTAAAATAGC CGTTCGCAAT
GGGGGTCGGC AGATGATGGC AAGCTATGCT AAGCAACGGC CGTGGCTGCA AGATCATACC
GATCCGTGGG TGCCGGTCGT GGTTGAGTAC GGCCCGGTGG AAGTGCGTGC CCAGATCGAT
GCCACCGAAG AGCAGCCAGA GGCGGCGAGT GGTTGGATTT TGTACGACTC GGCAAATATC
TATAAGGGTG CATTCAACGG TGCAGTGCGT CCGGCGCCAT AG
 
Protein sequence
MGLSVRRIAR FWLFCCLIAG VLVSCTTQPV ILTGVVTDAY TGQPVVGATV TIGEQTLTTD 
DQGRFQTERW RPEDTLSLKA PAYEPLEMPL VDQPRVGKSG VLTVTITTAL RPNVLSGTIT
DMYTGQPVGG AEVRIAGNDG MSAVTDESGR YTLTNVPEAF TVTVSAPQYA PVTEQVARVT
VFDMRLRPNT LSGIVTNSYT GEAVAGAKVT LGAISATTDD TGRYLLRDVP AESGEITVTA
DGFASVTQPF TRTTTLDVAL RPDTLFGQLI DATTGKPVPN AAIIATETLT STAVAFVRIN
DSVEGRFRLP NLPERGFVQV LAPGYAKRVI PIEPGKMPDR IELEPFYVRG IYITAAVASV
PSLVDKFLDL IDRTELNTLV IDIKSDLRDD LGVVYYDSQV PLARELGLST PRVDFASILA
KAKERGIYTI ARVQLFSHDN ALSDARPEWS IRLRSTGEVY ADYPGPGIRY AYLDPTNQNV
WDYNIALAVE AAQMGFDEIN FDYIRFPDWF GPYSEFSEKL LFSEPIDPVT NPGRMYDVIL
EFMQRAHSAV NAAGAFMSID VFGRVVNGPS LTIAQDMTRM GEFTDYICPM PYPSLWWGGL
ENIAVPVKFP YETIKIAVRN GGRQMMASYA KQRPWLQDHT DPWVPVVVEY GPVEVRAQID
ATEEQPEAAS GWILYDSANI YKGAFNGAVR PAP