Gene Cfla_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2237 
Symbol 
ID9146137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2497377 
End bp2498927 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content78% 
IMG OID 
ProductCurculin domain protein (mannose-binding) lectin 
Protein accessionYP_003637327 
Protein GI296130077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.120186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00817186 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGACGTG TGGGGGCCAC GGTCGTGGCA GCCGTGCTGG CGTCGGCCCT CGCGGTGCCC 
GCGGCCGCCG GACCGGGCGA GGACGTGCTG CGCCCCGGCG AGCAGCTCGC GCCGGGCCAG
GCCCTGCTCG CCGCCGGCGG CGGTCACGTG CTCGTCGTGC AGCCGGACGG CGCGCTCGGC
CTGTACGCGG TCACGGGCGA CGTGACCGAC GCGATCGTGC GCTGGTCCTC CGGCCGCGGC
GTGGCCGGCG CGACGCTCGT CGCCGACGCG TCGGGGGACG TACGCCTCGT CGCCCCCGAC
GGCGCCGTGC TGTGGAGCAC CGGCACGGTG GGCTCGGGCG GCGCGCTGCG GTTGCGCGAC
GACGGCGAGG TCGTCGTCGA GGCGGCGGAC GGCACGGCCG TGTGGGGGAG CGGCACGGCG
CTGGCGCCCT CGGTCCTCGT GGGCCCCGGC AGGATCGCGG GGGACGTCGT GCTGTCGTCG
CCGGACGGGC GCCACCTGCT GCACGTCGAC CCGGACTCCG GTGTGCAGCT GCGCGGACCC
GACGGCACCG TGCGGTGGGC GCCGCCGGCC GGCGACGGCG CCGAGGCCGA CCCGGCGGTC
GCCCTCGAGC TGCGTGCGGA CGGCAATCTC GTGGCGGTGG ACGCCGACGG CGACCCCGTG
TGGCGCAGCC GCACCGCGGG GCGCGGGGCG GTGTCGCTCC TGCTCCAGGA CGACGGGAAC
CTCGTGCTGC TCGGTGCCGA CGGTGCGCCC GTGTGGGACG CGGGCAGGCC CATCGGACCC
TCGGGGCTCG ATGCGACGGG TGCGCTCGCG GGCGAGGCGT CGCTCGGCTC GCCGTCCGGG
CACCTCGGTG TGCGCGTGAC GGCGGGTGCG CTGGTCGCGA CGTGGGACGG CGCGCCGATC
TGGTCGTCGT CGACCGTCGG GGGCGTCACC GCGCGCGTGC AGGAGGACGG TGACCTCGTG
CTCCTCGACG CCGCCGGTGC GGCCGTGTGG CGGTCCGGCA CCGCCGGCCG GCCGGGGGCG
CGGCTCGTGG TGGAGGAGAG CGCGGTGCTG CTCGTCGCCC CCAACGGCGA GGTGCTCTGG
CAGGTCGCGG TCCCGGAGGC GCTCGTGCCG ACGGGCGTGC AGCCGACCGA CTGCGACGAC
GTCGACGGGC CGGTCGCCGC CGGCGACGTG GTACGCACCC GCAGCGGCAT CGTCGTCCAC
CCGTGCCTCG CCGAGGCCCT GGACGCCATG GTGGCAGCGG CGCGCGCCGA CGGCATCGAG
CTGCACGGGG GCGGGTGGCG CAGCCCCGAG CAGCAGGTCG CGCTCCGGCG TGCGCACTGC
GGGCCGAGCG ACGCCGACGT GTACGACAAG CCGGCCTCCG CGTGCCGACC GAGCACCGCG
CGGCCCGGCA GCTCGCGGCA CGAGCGCGGC CTCGCGGTCG ACCTCACGTC CGGCGGGCGG
TCCCTGACCG CGGGTTCGGC GGCGTACGCC TGGCTCGTCG AGCACGCGGG CGCCTACGGT
CTGGAGAACC TGCCGGGCGA GCCGTGGCAC TGGAGCGTCG ACGGGTCCTG A
 
Protein sequence
MRRVGATVVA AVLASALAVP AAAGPGEDVL RPGEQLAPGQ ALLAAGGGHV LVVQPDGALG 
LYAVTGDVTD AIVRWSSGRG VAGATLVADA SGDVRLVAPD GAVLWSTGTV GSGGALRLRD
DGEVVVEAAD GTAVWGSGTA LAPSVLVGPG RIAGDVVLSS PDGRHLLHVD PDSGVQLRGP
DGTVRWAPPA GDGAEADPAV ALELRADGNL VAVDADGDPV WRSRTAGRGA VSLLLQDDGN
LVLLGADGAP VWDAGRPIGP SGLDATGALA GEASLGSPSG HLGVRVTAGA LVATWDGAPI
WSSSTVGGVT ARVQEDGDLV LLDAAGAAVW RSGTAGRPGA RLVVEESAVL LVAPNGEVLW
QVAVPEALVP TGVQPTDCDD VDGPVAAGDV VRTRSGIVVH PCLAEALDAM VAAARADGIE
LHGGGWRSPE QQVALRRAHC GPSDADVYDK PASACRPSTA RPGSSRHERG LAVDLTSGGR
SLTAGSAAYA WLVEHAGAYG LENLPGEPWH WSVDGS