Gene Cagg_0810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0810 
Symbol 
ID7268834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1010307 
End bp1011608 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content56% 
IMG OID643565660 
Productlaminin G 
Protein accessionYP_002462169 
Protein GI219847736 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.478954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCAC ACAACGGCTA CAATCTCTAC CAGCATACCG CTTTACTTGC CCTGGTGGGT 
CTGTTAGTCA TGGCCCTTTT CTTCAGATCA ACACGTTACG CGGCGCCGCT CCAAGCCCAA
GGGGGGTCAT CGCTGCGCTT CTATGGCAGC GACACCAATG ACCGCGACCG TGTAAAAATT
CCACTCGGTC AGATAGACTC TGCGGGCCGG CTTATCACAT CTCACCCGGT GAACGTTGGC
GATGCCTTCA CGCTTGAGTT TTGGATGAAG ACTGCGCTCG GCAATACCGC TCCGCCTTGC
CCTACCGGTT GGTACACCGG CAACATTATC ATCGACCGTG ATGTGTTTGG GGCCGGCGAT
TACGGTGATT ACGGCGTTGC TATCTGTAAT CAACGGTTGG TCGTAGGCAT AAGTGTGGGA
AGTGATGACC GACTGCTGAT TGGTAATACG GTCGTCACCG ATGGCCTCTG GCATCATATT
GCTATTGTCC GTGCCAACGA CGGTAAGGTA CGGCTGTTCG TCGATGGGCA ACTCGATGGC
ACGCTGAATG GGCCGGTAGG CCGCATCGAC TACCGACAAA ATCGTTCGAC GAGTTATCCC
ACGAGCGACC CTTATCTCGT ATTAGGAGCT GAAAAACACG ATTTCCCCGG TAGTCGATAC
TACGATGGAT GGATTGACGA TATGCGTATA TCGCGCATTG CGCGCTATAC ATCTCCGTTT
ATTCACCCCA CCGTACCGCA TGCGGTAGAC GATGATACCG TCGCACTTTA TCGTTTCGAT
GAAGGAAGTG GGGTTGTTAT TGGCGATAGC GCTACCGGCG GCCTGAGTGT TGGTGAACTG
AAACCGCGTA CCGGTGGAGC GGCACAACAC TGGTCGAACG ATACTCCATT CACGACGGTG
GTTATAACTA CTGCGACCCA CACCGCCACG CCGGTTCCGT CGCCGACCCC GACCCACACC
GCCACGCCGG TTCCGTCGCC GACCCCGACC CACACCGCCA CGCCGGTCCC GTCGCCGACC
CCGACCCACA CCGCCACGCC GGTTCCGTCG CCGACCCCGA CCCACACCGC CACGCCGGTT
CCGTCGCCAA CCCCGACCGG TACAGCCAAA CCTATGTCGA GTCCTACTAT CGTCAGTGTG
CCACCAACCA CAACACCAAC ATTACCAATT CGGATTTACA TCCCTTTGAT TCTTCAGCCT
CGTCTCGCCC CAAGCACATT ACAATCAGGA GTTGCACCGT ATGACCAGCC CAACCATCAA
TCGAACCGAT CTCATCAATC TTGTTATTAC CAGCCTACGT GA
 
Protein sequence
MPSHNGYNLY QHTALLALVG LLVMALFFRS TRYAAPLQAQ GGSSLRFYGS DTNDRDRVKI 
PLGQIDSAGR LITSHPVNVG DAFTLEFWMK TALGNTAPPC PTGWYTGNII IDRDVFGAGD
YGDYGVAICN QRLVVGISVG SDDRLLIGNT VVTDGLWHHI AIVRANDGKV RLFVDGQLDG
TLNGPVGRID YRQNRSTSYP TSDPYLVLGA EKHDFPGSRY YDGWIDDMRI SRIARYTSPF
IHPTVPHAVD DDTVALYRFD EGSGVVIGDS ATGGLSVGEL KPRTGGAAQH WSNDTPFTTV
VITTATHTAT PVPSPTPTHT ATPVPSPTPT HTATPVPSPT PTHTATPVPS PTPTHTATPV
PSPTPTGTAK PMSSPTIVSV PPTTTPTLPI RIYIPLILQP RLAPSTLQSG VAPYDQPNHQ
SNRSHQSCYY QPT