Gene Cagg_0290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0290 
Symbol 
ID7267471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp359975 
End bp361261 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content57% 
IMG OID643565159 
Productprotein of unknown function DUF58 
Protein accessionYP_002461673 
Protein GI219847240 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA CCCGACCGAT CTTTATCCTA CTGCTTGCCG GTACAAGTTA CCTCGCCGCC 
CAAACAACCG GTCAGCGTCT CTTTTTTCAT CTCAGCTACA TTCTGGTCAG TCTGCCCCTC
GTGGCGCTGA TCTGGACGTG GCTTAACTTG CGTGGTCTGC GCATCGAGCG TGCCCATACC
TCCTTGCGGG CCAGCGTCGG TGAATACATC CGCGAACGGA TTACCATTCA TAACCGATGG
TGGTTACCCA AACTGTGGAT CGAACTCCAC GATGAGTCTG ATCTGCCGCA GCACGAACCC
GGTTTCGTAA CCTACCTGGC AGGCCGCGAA TCAACCCGTT GGACAACCCG CTCGCTCTGT
ACCCAACGTG GCCGGTTTCG CCTTGGTCCA ACCCGCGTCA TCAGTAGCGA CCCATTTGGC
CTCTTCCGCT TCTCACGCCT GATCCCCGGT AGTGGTGAAC TTATCGTCTA TCCCGCCTCT
GAAATCATCG CTACATTCCG CCTCCCCTCT GCCGAACGGT CCGGCGGTGC GAGCAATTTG
GTGCGCGTCC ACAGTGTTAC CCCCAACGTT GCCACCATCC GCGATTACCA ACCCGGCGAT
GGCTTCAACC GCATTCACTG GCGCAGCACG GCTCGTTACA ACCGTCTCAT GGTCAAAGAA
TTTGAGCTTG ATCCGGCGGC AGACATCTAT CTCATTCTCG ACCTTAATGA ACAGGCAGTC
ACGCGGATCG ACGAACCGGC GCTGCTCGCT CATGAACGGG CCGGTGTACC GTGGTGGCAG
CGCCAACCGA CAATCCACCG TCACGCCTCT CCCATCTCAA CTGAAGAGCA CGCCGTCACG
GTAGCGGCAT CGCTCGCGCG CACGCTGCTC AACCAAAACC GAATCGTTGG GCTTTTAGCG
TGGGGCGAAC GGCTGGAAGT CATCCCCGCC GAGCGTGAGG AACGTCAATT ATGGAAAATG
CTCGAACTAC TGGCCGTCTT ACGTGCGACC GGGCAACACA CCCTTGCCGA ACTCCTCATC
GCCGAAGGAC AGCGCTTCGG ACGCGATACA ACGCTGATTA TTATCACCTC TGATCTCGAT
CCCCGTTGGC TGGCAGCACT GCAACACCAC CTCTACCGCG GCACACGCGC CGTTGTTATC
TTCATCGATC CGCAGAGTTA CGGTGGCCGT TATGATCCGG CGCCTCTCCT CAACCACCTC
ATTGCCCTGC ATATCGATGT GTATCGTCTC CAACGAGGTG ATGCACTGGC CGATGCGTTA
CGGCAACCGA TCGTAGTGAC AAGGTAA
 
Protein sequence
MNTTRPIFIL LLAGTSYLAA QTTGQRLFFH LSYILVSLPL VALIWTWLNL RGLRIERAHT 
SLRASVGEYI RERITIHNRW WLPKLWIELH DESDLPQHEP GFVTYLAGRE STRWTTRSLC
TQRGRFRLGP TRVISSDPFG LFRFSRLIPG SGELIVYPAS EIIATFRLPS AERSGGASNL
VRVHSVTPNV ATIRDYQPGD GFNRIHWRST ARYNRLMVKE FELDPAADIY LILDLNEQAV
TRIDEPALLA HERAGVPWWQ RQPTIHRHAS PISTEEHAVT VAASLARTLL NQNRIVGLLA
WGERLEVIPA EREERQLWKM LELLAVLRAT GQHTLAELLI AEGQRFGRDT TLIIITSDLD
PRWLAALQHH LYRGTRAVVI FIDPQSYGGR YDPAPLLNHL IALHIDVYRL QRGDALADAL
RQPIVVTR