Gene Cagg_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1472 
Symbol 
ID7269306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1806095 
End bp1808992 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content59% 
IMG OID643566314 
Producthypothetical protein 
Protein accessionYP_002462813 
Protein GI219848380 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00440571 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGC CGGCGTTGCG GACTCAGCAG GAACGAGTCC ATGCCTGCTT GCGCTCCGCT 
ATTGAGCGAC CGGCGTTGCT CGAAGCTGTC TGCACGATGC TGCGCGAACA ACATGGCGTG
ATTGCTCTCG ACGCACCAAT GGGGAGCGGA GCGACGACGT TGCTCGCCCA GTTGGCAGTG
CGCGCTGAGT GGCCACTCTG GCTGGCCGAT GATGATGATG GCGGCGGGGC ACTGGCATTC
TACGCCCAAA TCGCCGCACT ACGCCGCCCT TCTTTGCCTT TGGTCGATCC TGCTGCATTA
ACCGACCCTG CCACTTTTGA ACGGCTTTTG GCCGAAGTCG TCGATCCGAA TCGACCACTG
GTGCTACTGG TCAGCGCGCC TAACCGTGAT CGACAGCCCT TGCGCTCACT GCCGTTACCA
CTACCGCTCG ATCTGCCTGC CGGTGTGACA CTGCTCGTCC ACGGTTCATT GCCAATCGAG
CCTGATGCGC GAATTGTCTT GCCAAAGGCC GATAGTGCCC TTTTCCAAAC CCAAGCGTCG
TTGCTCGAAC GCCGTAATTG CCCACCAGGC TGGCGCAAAC CGCTTGTTTT GGCAGCCCGC
GGTAATCTGC TCTATCTGAG TTGGGCTGAA CGCTGGCTCC ACCTTGGTTT GCTTGATGTG
GCAAATTTGC CACCCGATCT CGATCATCTG TTGCAGCAGT GGTGGCAGTC GCTTAGTCGA
ACCGAGCAAC GATTGGCCGT TTTGTTGGCC GCCGCGGGTG AACCGTTGCC GATGACGGTG
TTAGCCGAAG TTAGTGCCGA ACACCCACAT CTCATTCTCG ACCGCTGGGA AGAACAGGGT
CTCGTCCATA TCAATCTACG CCGACTGAGT GAGGATGATA CGATTTTGCT CGTGCGCTAT
GCGCATCGCG CGGTGCGCTT GTTTTTGGCC CGTCACGCTG CGCACGAGAT GAATGCTGCA
CACGGAGAGC TGGCCCGTTG GTACGCCGAA CGGCTCAAAC AAAACCCGCT CGATTTGACG
AACCGTTATC TAGGACGCCA ATTGGCCCGC CACACAGCGT TATGTCCTCC TGCCCAGCGT
CCGGCTCATT TGCCTACGGC CAACCCAACA ACTTGGTTGC GTGAACGAGA ATTGCGTGAA
GGAATAGCCG GCGCGCTGCG GGATGCCGGT TGGATGCTGT ACGATGCCGC AGCCGGTTCG
CCGTTGGATT TAGCGCGGAT TGCTGCCATC ACCGGTACAC TTGCTACCCG CGCGCGGCAA
CTTACCGGCG ATGTTGTGGT TGCTGCCTTT CTCACCGCTG TTCAAACCGG TGGACGTGAA
GGGAGTTTGC GCCGGGTAAC GGCGATCGTT GAGCAGTTGC CCGATGGTGT CCCAAAAGCG
GCTGTGTTGC GCCAGCTCGG TGAGGCGTGT TATAGCGTTA ATATGCGTAG CGCCGCGATG
CGTCTCCTTT CTCGGGCACT TGACCTCGAA GCACAGCCGG TTTCGCGGGC TTGGCGTGAC
GTTCGTGATC AAGCTATCGA GGCGCTGGCT ACTGCTTGCT TGATAGCCGG TGATGTTGAT
CGGGCTTTAG CCTGTGCGGA GTTGATCGAT CTACTCGAAC GCCGTGCCCA GGTTGAGACG
TTGGTGATAC GACGCTTACT TGAAGATGGT CAGTACGACC GGGCATGGCG TTTGTCACGC
TCCATTCTGC ACGAAAATCG GGCTGCGTGG GCGCAGGCCG AGGTGGCGGT TGCCTTAGAA
CGGATCGGTG ATCCGCGCGG TGCGATGATG TTGGACGAGC TGAAGGTAGA GACTGCACGC
GCCTGGGCCG AGATTGAGCT GGCTTGCGAG GTGGCCTTGC GTGATGAAGA GGCTGCGTTG
CGCCGAATTA TGGCGTTACC CGGCCAACAT CAGCGTGATC GCGGTTTAGC TCGCCTGGCC
CGCGTCTTTG CACACGCCGA AAAGGATGGT GATGCATTGG CAGCGGCTGA GCGGATCAGC
AATCGTGAGT TGCGTGTGAC GACGTTGCTC GAGTTGCGCG TGTTGTTGCA AGGGTTGGTT
GCTAATCTTG CTACCGAACG AGCAACGCGC GAGATAGATG CCCTCCAAGG CGAGGATCGT
CCGATATTGT TGGCTGCTCT TGCTTCGGCC CATGCAGCGA TTGGTCGTAA GGATCGGGCA
TTGGCGATAG CCAATCAGTT ACGCGGGGAA GAACTGGAAC GGGCCTTGTC GCGGGTTGCG
GTTGCTTGTG TACAGGCAGG TGATTATGCC GGAGCGCAGG CTGTGTTGGC CCAGATGACC
GATGACGATG AGCGAGATTG GGCACGCGAT GAGATTGCGC GCACGTTGGC TTCTATCGGT
GATTGGGAAT CGGCAATGGC GCAGGCAATG GCGATTGTTG CTGCCGATCA GCGTGCGCGT
ACTAGCGCCG ATTTGGCCAT TACTCGCGCG CGTTCCGGTG ATGTACTCAC CGCTGTATCA
ATGATTCGTG CGATCGAGGT GCCTGCTGAG CGGGGACGTG CCTTAGTGCT GATCGCACCG
TTGTTAGCAA CGACCGATGC CACGCTGGCC GACCAACTGG CCGATGAGCT GCTGATCGGT
GAGGTACGTA GCCGGTATCG TGCAGCCCTG GTGGTAGCCC TCGCCGAACG CGGTGAGTTG
GCGACTGCGG CTAAGATCGC CCGTCGCATC CGCCGCCGGA ATGAGCGGGT ACGCGCCGAA
CTGGCAATTA TTGTGGCCCT TGATCCTACC GATCCCATGA CCTTGGCGCG CTTGGCAACA
ACATTGGCAA AGGCCGCGGT GGGGCGTGAA GAGATGTTTC ATGCACTTGA GCTGGTCATC
CCTCTCTTGC AACGAATCGG TGGGACCCCG TTGCTGGCCG ATCTGGCGAC GGCGATCGTT
GCCGATGATC GGGCGTAG
 
Protein sequence
MVAPALRTQQ ERVHACLRSA IERPALLEAV CTMLREQHGV IALDAPMGSG ATTLLAQLAV 
RAEWPLWLAD DDDGGGALAF YAQIAALRRP SLPLVDPAAL TDPATFERLL AEVVDPNRPL
VLLVSAPNRD RQPLRSLPLP LPLDLPAGVT LLVHGSLPIE PDARIVLPKA DSALFQTQAS
LLERRNCPPG WRKPLVLAAR GNLLYLSWAE RWLHLGLLDV ANLPPDLDHL LQQWWQSLSR
TEQRLAVLLA AAGEPLPMTV LAEVSAEHPH LILDRWEEQG LVHINLRRLS EDDTILLVRY
AHRAVRLFLA RHAAHEMNAA HGELARWYAE RLKQNPLDLT NRYLGRQLAR HTALCPPAQR
PAHLPTANPT TWLRERELRE GIAGALRDAG WMLYDAAAGS PLDLARIAAI TGTLATRARQ
LTGDVVVAAF LTAVQTGGRE GSLRRVTAIV EQLPDGVPKA AVLRQLGEAC YSVNMRSAAM
RLLSRALDLE AQPVSRAWRD VRDQAIEALA TACLIAGDVD RALACAELID LLERRAQVET
LVIRRLLEDG QYDRAWRLSR SILHENRAAW AQAEVAVALE RIGDPRGAMM LDELKVETAR
AWAEIELACE VALRDEEAAL RRIMALPGQH QRDRGLARLA RVFAHAEKDG DALAAAERIS
NRELRVTTLL ELRVLLQGLV ANLATERATR EIDALQGEDR PILLAALASA HAAIGRKDRA
LAIANQLRGE ELERALSRVA VACVQAGDYA GAQAVLAQMT DDDERDWARD EIARTLASIG
DWESAMAQAM AIVAADQRAR TSADLAITRA RSGDVLTAVS MIRAIEVPAE RGRALVLIAP
LLATTDATLA DQLADELLIG EVRSRYRAAL VVALAERGEL ATAAKIARRI RRRNERVRAE
LAIIVALDPT DPMTLARLAT TLAKAAVGRE EMFHALELVI PLLQRIGGTP LLADLATAIV
ADDRA