Gene Cagg_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0036 
Symbol 
ID7269033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp58214 
End bp59548 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content55% 
IMG OID643564909 
Productprotein of unknown function DUF58 
Protein accessionYP_002461425 
Protein GI219846992 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.764731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACCAA CACCACGCCT GCTATTCCTT GTCTTACTCG CTGCCCCCCT TATCGCCGGT 
ACCGCATTTG CGCCATGGCT GAGCTGGGCA GCAGCTATCT ACTTGATCCT CGTAGGAGGG
GTGGTGATAA GCGATATGCT CCTCTCACCT GATCCCAAGG ATATCGAAGT CGAACGCTTG
TGTGAGAAGC GACTATCCCT TGGGGCAGCG AACCTCGTCA CAATCCTCAT TACCAACGCT
TCAGCACGTG TGTTGCGGTT TGAACTGCGT GATGAATATC CGGTAGAGAT ACCGGTCGAT
ACCGACCGGC TCAGCGGTGT TGCCGAGCCG TTTGGTGTGT GTGCGGTTCG TTACCACTTA
CGGCCACAGC GGCGTGGTGA TTATCACTTT GGCGACATCG TCATGCGCTA CGACGGTGTG
CTTGGGTGCC ACCGTCGCCA GGTACGCTTT GCCGCAGCCC GTACCGTGCA GGTGTATCCC
AATCTGCTCG CAGCACGTAA ATACGATCTC CTTATCCGTC GTGGTCAGTT GCGTACTATC
GGGATACGTT CAATTCGCCA ACTCGGTAAG GGGGGCGAGT TTGAGCATCT GCGTGAATAT
ACGCCGGACG ATGAATACCG CCGGATCAAT TGGAAGGCGA CGGCTCGGCG CGGAAAACCG
ATTGTTGCCG AAATCGAAGC GGAACGGAGT CAGCAGATTA TCTGTGTGAT CGACGCCGGA
CGGCTGATGG CAACGCCGGT GGCCGATCCA CTCCAACCCG ATGATCCCGG TCTTACTCGG
CTCGATTATG TCGTTAATAC GGCATTAATG CTGAGTTATG TCGTCATTGG TAAAGGCGAT
CAAGCCGGTA TGCTTACCTT TGCCGGTACC GTCGAGAACT TTATCCCACC ACGCAAGGGA
AAGGCTCAAT TCCAACGCTT GCTTGAGGCA TTGTACAATG TACAGGCACA GCCGGTTGAA
GCCGACATTG CTGCGGCCTT GGCTTATCTC GATCAGCGGC AGTCACGGCG CGCACTGATC
GTTATCTTTA CCGATATTAC CAATCCGGCA GCCGTACAAC CGTTGATCGG TCTTCTCCAA
CGGCTCGCAC GCCACCACTT GCCGCTCTGT GTGACGATTA GCGATCCGAA TATCGTTAAT
GTTGCCGGTC GTCCGGTTAC TGATAGCCAT GGGCTGTTTC GCCGTTTGGT CGCCGAACAG
TTGGCCAATG AGCGCCGGGC TTTACTCGAT CAGATTCAAC GCAGTGGTGC GCTAACGCTC
GATGTGCCCG CAACTTCCCT GACGGTAGCG GTGGTAAACA CCTATTTGCG CTTGAAAGAA
GAGGCTCGGC TGTAA
 
Protein sequence
MLPTPRLLFL VLLAAPLIAG TAFAPWLSWA AAIYLILVGG VVISDMLLSP DPKDIEVERL 
CEKRLSLGAA NLVTILITNA SARVLRFELR DEYPVEIPVD TDRLSGVAEP FGVCAVRYHL
RPQRRGDYHF GDIVMRYDGV LGCHRRQVRF AAARTVQVYP NLLAARKYDL LIRRGQLRTI
GIRSIRQLGK GGEFEHLREY TPDDEYRRIN WKATARRGKP IVAEIEAERS QQIICVIDAG
RLMATPVADP LQPDDPGLTR LDYVVNTALM LSYVVIGKGD QAGMLTFAGT VENFIPPRKG
KAQFQRLLEA LYNVQAQPVE ADIAAALAYL DQRQSRRALI VIFTDITNPA AVQPLIGLLQ
RLARHHLPLC VTISDPNIVN VAGRPVTDSH GLFRRLVAEQ LANERRALLD QIQRSGALTL
DVPATSLTVA VVNTYLRLKE EARL