Gene Cagg_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1035 
Symbol 
ID7268407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1279894 
End bp1282926 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content63% 
IMG OID643565880 
Producthypothetical protein 
Protein accessionYP_002462385 
Protein GI219847952 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCA AACATCCGTT GGTAATCGCT AGTGCGATGA TCGTGCTTTG TCTATCCATC 
ATCGGCTACC ATACGACGCC GGCACAAGCG ACGGGCATTT TGTATGTGGC ACCGACCGCA
GTAGGCAGTG GCGATTGTTC GTCATGGGCA AATGCATGTA CGTTACAGAC CGCGCTCACC
AATGCAGTCG CCGGTCAGCA GATCTGGGTA CAGGCAGGTG TCTATAAACC AGGAAGTACA
CGGAGCGATA GTTTTACGCT GAAGAGCAAT GTGGCCATTT ATGGAGGATT TGCCGGTATC
GAGACAAGTC TGAGTCAACG TAATACATCC AATATCACCG TTCTGAGCGG TGATATTGAT
AACAACGATA GTATTGACAC CGGCGGTGTG ACGACGGTCA TCAATGGGGC GAACAGCTAT
CATGTTGTCA CAGCGTTTGG AGTAACCGGC GCCGTTCTTG ACGGATTTGT AATTACCGGT
GGCCAGGCGA ATGGAGGTGG CTCTGACGAC AAGGGGGGTG GGCTGTACAA CTGGAATAGC
AGCTTTTCGC TGAAAAACAT TACCTTCGCC GGTAATTATG CCTCGAGCGG TGGTGGGATG
TACAATGATC ACAGTCTTTG CGGACTAGAG CTAGAGACTG TTACGTTCAT GGACAACACT
GCGCTCTATT TTGGCGGAGG AATGTTTAAT GAACAGATCA GTACCAGCTG TACCACCACC
ACTAATCCAA CTCTGAAGAA TGTGACCTTC GCCAATAATA GTGCCGATTA TGGCGGTGGG
ATGTTTAATT CAAATAGCAG TCCAGAGATC ATCCACGTCA CCTTTAGCAA TAATCAGGCA
AACCGGAAGG GTGGCGGAAT GTATAACAAC TCGGGCAGTA ATCCTGAGGT ACGGCGGGTG
ATTATCTGGG GCAATAGCGC TCAGACTCAG GCTGGAGTAA GTAATGTATC CGGAGCAACA
CCCAGGTTTA CCCACAGTAT TGTTGAAGGA TGCATAAATC CGATTAGTTT GAACTGGATA
AGTGCTTGCG GCACCAATAA TGGGGGCAAT CTTGCCGCTA ATCCGAATCT CGATTCGCTC
ACCGATTTTG GTGGGCCAGG GCGTCAACTC TTCCCCTTGC TACCAGGTTC TGCCGCGATT
GACGCCATAA CCGGTTACTG CCCTTCGACT AGCGATCAGC GCGGTGTCGT TCGTCCGGTT
GATGGTGATG GTGATAGTGT AGCCCGCTGT GACATAGGTG CGTTTGAGTT TACGGCATCT
GTGCCAACTC CAACCCCGGC GGTGACGGCG ACGGTCACCC CGACAGCCAC CCCGACAGAC
ACGCCGACAG CCACCCCGAC AGACACGCCG ACGGCGACGC CAACAGACAC GCCGACGGCA
ACAGATACCC CGACGGTGAC GGCGACAGCC ACACCGACAG ACACCCCGAC AGCCACCCCG
ACAGACACCC CGACGGCGAC GCCAACAGAC ACGCCGACGG TAACGGCGAC GGCCACCCCG
ACGGCCACGC CGACGGCAAC AGATACCCCG ACGGTGACGG CGACAGCCAC ACCGACAGAC
ACGCCGACGG CGACGCCAAC AGACACGCCG ACGGCGACGC CAACAGACAC GCCGACGGCG
ACAGACACCC CGACGGCGAC GGCGACGCCA ACGGCGACAG ACACGCCGAC GACAACGGCC
ACGCCGACGG CCAGCCCGAC GGTAACGGCC ACGCCGACGG CCACGCCAAC TGACACGCCG
ACGGTGACGG CGACGGCCAC GCCAACGGCC ACGGCCACGC CGACGGCGAC AGACACCCCG
ACGGTAACGG CAACGGCAAC GGCCACGCCG ACGGCCACGC CAACAGACAC GCCGACGGCG
ACAGACACCC CGACGGCGAC GGCGACGCCA ACGGCGACAG ACACGCCGAC GGTAACGGCA
ACGGCAACGG CGACGCCGAC GGCCACGCCA ACTGACACGC CGACGGTGAC GGCGACAGCC
ACGCCAACGG CCACGGCCAC GCCGACGGCG ACAGACACGC CGACGGTAAC GGCCAGCCCG
ACGGCAACGG CGACGCCGAT GGCGACAGAC ACCCCGACGC CAACGGCCAC GCCGACGGCG
ACATACACCC CGACGGTAAT GGCAACGCCA ACGGCCACGC CGACGGCCAC GCCAACTGAC
ACGCCGACGG TGACGGCGAC AGCCACGCCG ACGGCCACGC CAACTGACAC GCCGACGGTA
ACGGCAACGG CCACGCCGAC GGCGACAGAC ACGCCGACGA CAACGGCCAC GCCGACGGCG
ACAGACACCC CGACGGCGAC GGCGACGCCA ACGGCGACAG CCACGCCAAC GGCCACGGCC
ACGCCGACGG CGACAGACAC CCCGACGGCG ACGGCGACGC CAACGGCGAC AGCCACGCCA
ACGGCCACGG CCACGCCGAC GGCCAGCCCG ACGGTAACGG CCACGCCGAC AACAACGGCC
ACGCCGACAG CGACAGACAC GCCGACGACA ACGGCCACGC CGACGGCGAC AGACACGCCG
ACGGTAACGG CAACGGCCAG CCCGACGGCA ACGGCGACGC CGACAACAAC GGCCACGCCG
ACGACAACGG CCAGCCCGAC GGCAACGGCC ACGCCGACGG CGACAGCCAC CCCGACGGCA
ACGGCGATGG CCACGCCGAC GGCAACAAAT ACGCCGACGG TGACGGCGAC GGCCACGCCG
ACGGCGACGG CCACGCCAAC AGACACGCCG ACGGCAACGG CGATGGCCAC GCCGACGGCA
ACGGCGACGC CGACAACAAC GGCCACGCCG ACGGCGACGG CCACGCCAAC AGACACGCCG
ACGACAACGG CCACGCCGAC GGCGACGGCG ACAGCCACCC CGACGGCAGC GGCGACGGCC
ACGCCGACGG CAGCGGTGAC CGCAACGACC TTACCGGCGC CGTCCTACCG CATCTACAAT
ACCGATTGTG ATGGAAATGA AGACTGTCTT GCCGGCACCG CCGTTCTACC GCACCTATAT
TCCGATAACA GCGCGGTAAG CAATCCGGCG TAG
 
Protein sequence
MNAKHPLVIA SAMIVLCLSI IGYHTTPAQA TGILYVAPTA VGSGDCSSWA NACTLQTALT 
NAVAGQQIWV QAGVYKPGST RSDSFTLKSN VAIYGGFAGI ETSLSQRNTS NITVLSGDID
NNDSIDTGGV TTVINGANSY HVVTAFGVTG AVLDGFVITG GQANGGGSDD KGGGLYNWNS
SFSLKNITFA GNYASSGGGM YNDHSLCGLE LETVTFMDNT ALYFGGGMFN EQISTSCTTT
TNPTLKNVTF ANNSADYGGG MFNSNSSPEI IHVTFSNNQA NRKGGGMYNN SGSNPEVRRV
IIWGNSAQTQ AGVSNVSGAT PRFTHSIVEG CINPISLNWI SACGTNNGGN LAANPNLDSL
TDFGGPGRQL FPLLPGSAAI DAITGYCPST SDQRGVVRPV DGDGDSVARC DIGAFEFTAS
VPTPTPAVTA TVTPTATPTD TPTATPTDTP TATPTDTPTA TDTPTVTATA TPTDTPTATP
TDTPTATPTD TPTVTATATP TATPTATDTP TVTATATPTD TPTATPTDTP TATPTDTPTA
TDTPTATATP TATDTPTTTA TPTASPTVTA TPTATPTDTP TVTATATPTA TATPTATDTP
TVTATATATP TATPTDTPTA TDTPTATATP TATDTPTVTA TATATPTATP TDTPTVTATA
TPTATATPTA TDTPTVTASP TATATPMATD TPTPTATPTA TYTPTVMATP TATPTATPTD
TPTVTATATP TATPTDTPTV TATATPTATD TPTTTATPTA TDTPTATATP TATATPTATA
TPTATDTPTA TATPTATATP TATATPTASP TVTATPTTTA TPTATDTPTT TATPTATDTP
TVTATASPTA TATPTTTATP TTTASPTATA TPTATATPTA TAMATPTATN TPTVTATATP
TATATPTDTP TATAMATPTA TATPTTTATP TATATPTDTP TTTATPTATA TATPTAAATA
TPTAAVTATT LPAPSYRIYN TDCDGNEDCL AGTAVLPHLY SDNSAVSNPA