Gene CHU_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2103 
Symbolcel 
ID4186309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2448305 
End bp2449345 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content43% 
IMG OID638072103 
Productendoglucanase 
Protein accessionYP_678708 
Protein GI110638499 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.438737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAA AAATATCCGT AGTGCTTGTC CTTCTTACAG GCATGTTACT TTCCGCTTCG 
GTTTTTGCAC AAAAGACAAT CGTTGAAAAA TACGGTAAGT TGTCTGTAAA AGGAAATTAT
ATGGTTGGCC AGTACGGTGA TACCGTTCAG CTGAGAGGCA TGTCTTTATT CTGGAGTCAG
TGGATGGGGC AATACTACAA TTCAGATGTG GTAAAGTGGC TGCGCGACGA TTGGAAATGT
ACCGTAGTAC GTGCTGCAAT GGGCGTGGAA ATGGACGGAT ACCTTGAAAA TCCGGATACA
GAAAAAATGA AGGTGATGGA AGTGGTGAAT GCTGCTATTG CCAAAGGCAT TTATGTGATC
ATTGATTACC ACAGCCACGA AGCGCAGAAG AATCCTGCAG CGGCGCAACG GTTCTTTTCT
GAGATGGCAA AAAAATACGG GAACATTCCC AATATTATTT ATGAAGTTTA TAATGAACCA
CTGCAGGCAA CTTCCTGGAA TAAGGACATA AAGCCGTATG CAGAAGGTGT CATTACAAAA
ATACGTGTGT ATGATACAAC AAACATTATT GTGGTAGGAA CAAGACAATG GTCGCAGCTG
GTAACAGAGG CGGCAGCGAA TCCGATCACC CGTCAGAACA TCATGTATAC CCTTCATTTT
TATCCGGGTA CGCACAAGCA GGAATTGCGT AATGAAGCAC AAAAAGCATT GGATATGGGT
ATTGCCTTAT TTGTTACTGA ATATGGTACC TGCGATGCAT CGGGTAACGG AAATTTCAGT
CCGGAAGAAA CTGCTTTGTG GTATGAATTT CTGGATGCCC ACAAGATCAG TTATTGCAAC
TGGTCCATTG CGGATAAGCC CGAAACCGCT TCAGCTATTG TACCGGCAGC AAGTCCGTAT
GGTGGCTGGG CTGATTATGA TCTTACACCG TCGGGCAAAT TAGTACGCGA TGATCTGCGC
TTAAAAAATG GACCTATCTT TGACTCACTG GTAAAGACCA GTACTGGCGG AGTGTCTAAA
AAGAAATCAA AAACAAAATA G
 
Protein sequence
MIKKISVVLV LLTGMLLSAS VFAQKTIVEK YGKLSVKGNY MVGQYGDTVQ LRGMSLFWSQ 
WMGQYYNSDV VKWLRDDWKC TVVRAAMGVE MDGYLENPDT EKMKVMEVVN AAIAKGIYVI
IDYHSHEAQK NPAAAQRFFS EMAKKYGNIP NIIYEVYNEP LQATSWNKDI KPYAEGVITK
IRVYDTTNII VVGTRQWSQL VTEAAANPIT RQNIMYTLHF YPGTHKQELR NEAQKALDMG
IALFVTEYGT CDASGNGNFS PEETALWYEF LDAHKISYCN WSIADKPETA SAIVPAASPY
GGWADYDLTP SGKLVRDDLR LKNGPIFDSL VKTSTGGVSK KKSKTK