Gene Cagg_1315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1315 
Symbol 
ID7268606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1619269 
End bp1622520 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content45% 
IMG OID643566158 
Producthypothetical protein 
Protein accessionYP_002462659 
Protein GI219848226 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0321866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000474405 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATGTAT TGTATCGATC AATTGACCAA GAGCGTTCGC TGTACTCGTT GCAACCTACA 
ATAATAGTAG GGTTGGGCGA GATTGGCTGG TTGATCCGTG ATGAATGGGA ACGTCGTTAC
AAAACGTACA AAGAGTTTTA TTCACCGCTC ATTACAGATG TAACCGATGT TTGCTTCTGG
ATGCTCCCTT CCGATGTCAG TGAGTTGCCT GATTGGGAAA AATTGTCATT TGAAGAGCAT
CGGCAACGTT TTCTCACTCG CGAAACACTC ACCGATCATG CTCAACAGAT TGCGTCACAT
TATCACAAGG TGCGCGCACA TATTAAGTTG TCACTAACCG ATCATGTCCG ACCAACCGTC
TTCGTTATTG GTGCAACGTG GTCACCGGAA GGTCGCGCAT TGCTGTGGCC ATTAGCGTAT
CTGATCCGCT TCCTGTTTGG TGATCCGTAT GATTATGAGC TGGTCGGAGT ATTTGTTACT
GCACAGTGGG AGGATGGTAA ACGCAAGGCA CTGGAACGAG ATGCCCTGAC ATATGAATTG
TTGGAGGAAG GTAACAGGCT TGCACAATCA CCAGATTGGC AGCAAACATT TGCACGTGTA
TTTGCATGCG ATCTTATTCC AGAGAAAATT AGCTTTGATA AAGTATTTTT GATTGATAAT
GTAAAGCAAA ATAATGCAAC AAGTCCGGCA ACCTATAATC CAGCGGAAAT AGTTCAGCTT
ATCAGTGATA TGATCGAAGC ATGTTTGCAT AGTGATTTGC TGTCAATAAT TGATCGTTCA
TATACTGCTG ATTGCCCGAC TCATCATCCG TTATACATTG GAATAGGTAT TTCATCGCTT
GTTGTACCGT TGGTCAGGCT ATATGATAAT CTGACCAACA CTATTGTTGC CAACTTAGTG
CGTGTCCGTT TGCTGACAGA GGCTGAGCAA TTGAGATGGG ACGATCTCAA AAAAGAAATT
CAGCAGAAAA TTCAGCAGGA GTTGAACGTA GCGGTTCTAA ACGCCGTTAG GAGACGGTAT
CAATCGGATG TAGCAGAAAA TCAAAACCGT CTTGAGTTAA CAAAAGAATA TGAAATTGGG
CAGACGAAGC AGAAAGGAAA AATCGTTGTA ACCATAGTGA ATAACGTTGA CCGATTGCAA
AGTCTCAGTC GTATCCGTAT ACCGCAGGTA TTTGTTCGGT GGTATGGGCG GTTCTCTTCA
CGCCGTCTAT TGCAAAGTCT CAGTCGTATC CGTATACCGC AGGTATCGTT GTATATTCGC
GGGCCAGACC TGCTGAGTCC TGACGAGTCG TTACAAACAA TCGACGAACG AATCGACAGT
GAAAATAGCC AGATAGATAG TTTTCTTAGA CAGTTTCGTG AAACTATTCG TGATTATAAA
GACTGTTTAC GTTCAATAGA GAAAATATGT GATGAAGCTA TTCAGACTCA ATTACGTACT
GGTGAGCAAG GATTGGTCAG AGCAATAAGA TTAACGGAAA GCATCATTGA GCAGGTCAGA
AACGCCGTTA TTGAAACCAA GAAAGAGCTA TCCATACAAG AAGATGTGCT TAATCGTCTC
AAGAAGGGTT GGCGCTCGGT ACAGCGAGTT TCACCGCTAG AACGAGTAAA GAACGTAACA
CCGTTCTTTC CTCGTCCGCA AGCGCTTCTG CTACGTGTCT TCTTGATTGG CATAGTGTGT
TATCAGTTCT ACTACGATGG TTTATTACAA GGCATTTACA GACCGCCATT AGACTTACTG
CGGGCAATGT TGGGTGAGGG ATCAGAGGCA GCAATTCGCA GTACAGCGGT GGGTTTGATC
ATTCTGTGTC TAGGCGGACT AGCAATCATC ACATTGTTGC CATGGTTCGT AGTTCGTCTA
TACTTATTCT GGGCGAGGCA ATACGTTTGT AGAGAACAGC AGACCCAACT AGTGGAAGCA
CTGATTCTAA TGCGACTAAA AGTGCTTACT CGACTGCAAG GTAGGCTCGA ATATTACTTG
AAAGCGGAGC TTAAACGCTT GCATGCTGAG CTAGAGCAAC AACATGCACG TTTCGCAGAG
GAGCCGCCGG AGGAGACGTA TCAAGAGTAT TCTGTTGTTG ACCTTAAGGA GTTAATTCAA
TCCTTCCACT ATGATGTAGA AGCTGCAATC CAACGGCATC GACGACCACT AGTTACTTCT
TGGCTTCACG ACCATGCTGA TCTGAACAAT GTGTGGCCTA CTCTAATCAC GAAAGAAGAA
GTGTTTGATA ATGTGAGAAG ACAGATAGGC GATGTAGTAA AAGATCATCA TATTCGCCCA
ATAAGTACGT ATCTTGCTAC TCAGAACCTT GAAGAATGGG CAATGCGGAT CAGTAAATCG
AGTGTACCAT GGGTGAAGAT TGTGCAGCCG ATTTTTACCT TGGCTGATGA TGCTCAAAAC
CTACCGATAG AAATGATGTG CCTACTAGCT TCAGAAGGTC GTAGGAACCT GATTGCGCTG
CAGATTGTAC AGAATTTAGG TATGTGTGAG CTACTTAGTT GGGCTGATCC ATACCGAATC
ATGTTGGTTC GAATTATAGG CGGTTTTAAG CCAGCTCAGC TTACCCGCTG GTCAGAGATG
CACAACCGTT GGATGTTTGT TTCGCAAGCG GCAACCCAAA CTGTCACACC GTCGGTCTTG
TCGCAGCTAG CGAGTCTGGA GCAGGGTGGT TTAGCTCAAT CGGCAATGCC GGATCAAACT
GGCCAAGATG CTCAAGCGAT AGCTGGAGAC TTTGCTCATT TGGCGGTCTC ACAGCCAACC
GATGGCGTTA CAGGAGATGA GAAGATGAAC GATAGCTTTG ATAAAGTGGG ACACAATATC
CAAAATCTTC TCGATGCTGC TCGTAGTAAC GTGGCAGATC GATGGATTGA GCTGGAATTA
GAACAAGTAG AAGAGCAAAT ACGACCGCTT TTACAACAAA GTCTTACTCC AAACCCAGAG
GATGTAGTGA AAGCTTTAAA GGGGCTTTGC TATGTTTACG ATAATCCAGC GCTCGCCGAT
AGGGTGCGTC ATTGCTTGAG CGACATATTC CGATTAATTG ATGGTTGGTT GGCTCGTAAT
CACATTCGAC CACTAGTGCC TGAGCGAGGC GAACGCTACG ATCCACAAAT TCACGGTACG
GCGATTGGCG AAGAATCCGA TCAGAGTTTG CCAGGCGGTA CCATCAAGTG CCGTGTTCGA
CGCGGGTACA TTCAAGATAG TAATAATAAG GTACTACTTG AGCCACTCGT CATTGTTGTA
AAGGAGCCAT AA
 
Protein sequence
MNVLYRSIDQ ERSLYSLQPT IIVGLGEIGW LIRDEWERRY KTYKEFYSPL ITDVTDVCFW 
MLPSDVSELP DWEKLSFEEH RQRFLTRETL TDHAQQIASH YHKVRAHIKL SLTDHVRPTV
FVIGATWSPE GRALLWPLAY LIRFLFGDPY DYELVGVFVT AQWEDGKRKA LERDALTYEL
LEEGNRLAQS PDWQQTFARV FACDLIPEKI SFDKVFLIDN VKQNNATSPA TYNPAEIVQL
ISDMIEACLH SDLLSIIDRS YTADCPTHHP LYIGIGISSL VVPLVRLYDN LTNTIVANLV
RVRLLTEAEQ LRWDDLKKEI QQKIQQELNV AVLNAVRRRY QSDVAENQNR LELTKEYEIG
QTKQKGKIVV TIVNNVDRLQ SLSRIRIPQV FVRWYGRFSS RRLLQSLSRI RIPQVSLYIR
GPDLLSPDES LQTIDERIDS ENSQIDSFLR QFRETIRDYK DCLRSIEKIC DEAIQTQLRT
GEQGLVRAIR LTESIIEQVR NAVIETKKEL SIQEDVLNRL KKGWRSVQRV SPLERVKNVT
PFFPRPQALL LRVFLIGIVC YQFYYDGLLQ GIYRPPLDLL RAMLGEGSEA AIRSTAVGLI
ILCLGGLAII TLLPWFVVRL YLFWARQYVC REQQTQLVEA LILMRLKVLT RLQGRLEYYL
KAELKRLHAE LEQQHARFAE EPPEETYQEY SVVDLKELIQ SFHYDVEAAI QRHRRPLVTS
WLHDHADLNN VWPTLITKEE VFDNVRRQIG DVVKDHHIRP ISTYLATQNL EEWAMRISKS
SVPWVKIVQP IFTLADDAQN LPIEMMCLLA SEGRRNLIAL QIVQNLGMCE LLSWADPYRI
MLVRIIGGFK PAQLTRWSEM HNRWMFVSQA ATQTVTPSVL SQLASLEQGG LAQSAMPDQT
GQDAQAIAGD FAHLAVSQPT DGVTGDEKMN DSFDKVGHNI QNLLDAARSN VADRWIELEL
EQVEEQIRPL LQQSLTPNPE DVVKALKGLC YVYDNPALAD RVRHCLSDIF RLIDGWLARN
HIRPLVPERG ERYDPQIHGT AIGEESDQSL PGGTIKCRVR RGYIQDSNNK VLLEPLVIVV
KEP