Gene CHU_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1944 
SymbolyliI 
ID4186433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2274226 
End bp2275527 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content43% 
IMG OID638071946 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_678552 
Protein GI110638343 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.358319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0404039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAT TTTTATTTAT CAGTATGTGC ATGGGAGTAT TAGTTTCATG TGCTGAAAAA 
AAAGAACAGG CAGCTGTATT GGATGATACC ATTGCTACGG AAGTTACAGA TACTTTTCCG
AAGCCATTTG AAACAGAATC TGTCCGTAAA TTCAGTAAAG CACTTGGTTG GAAAGAAGGG
CAGGCACCAA CAGCACCGGC AGGTTTTGTG GTACAAAAAT ATGCGGATAA ACTGGAGAAC
CCCAGATGGC TGTATGTACT GCCGAATGGC GATGTGTTAG TAGCAGAAGC AGGAACAAAA
GGAGCATTGA AAAATGCAGC TTCTTTTATC AGCGGGAATT CGAAATCAAA ATTATCTGAT
GGAAGTGCAG ACCGCATTAC ATTATTCCGG GATACAGACA AAGATGGTAT GCCAGATGTG
CGGGAAATTT TTCTTGAAAA ACTGAACCAG CCTTTGGGAA TGTTGCTGAT TGAAAACACA
TTTTACGTAG CCAATACAGA CGGTGTAGTT TCGTTTCCAT ATAAAAAAGG GCAGACTTCG
ATCACGGCAA AAGCGAAACA GATTACAGAC TTGCCCGCTG GCGGGTACAA CAATCATTGG
ACGCGTAACC TGCTTGCCAA TAAAGATGAA TCTAAAATAT ATATTACCGT TGGTTCTTCC
AGCAATGTTG CAGAACATGG CATGGATGAA GAGCTTATGC GTGCAAATGT ACTTGTTATG
AATCCGGATG GAAGCGATAA AAAGGTATAT GCTTCGGGTT TACGTAATCC GGTAGGGGCA
GCCTGGGCGC CGGATACCAA AACATTCTGG ACTGTTGTAA ACGAACGTGA TGGTTTGGGT
GATGAACTTG TTCCGGATTA TTTAACAAGC GTACAGGAAG GTGGATTTTA TGGCTGGCCA
TATTCGTATT GGGGACAGCA TGCAGACCCG AGACTGGAAG GAAAAGGAAT GGATCTGGTA
AAGAAAGCAA TAGTGCCGGA TGTTGCATTG GGCGCACATA CCGCTTCGCT GGGCTTAGCC
TTTTATGATC AGAAAGCTTT TCCTGAAAAA TATCACAACG GTGCCTTTAT CGGGCAGCAT
GGTTCGTGGA ACCGTTCGGT ATTTTCAGGA TATAAAGTTG TATTTGTTCC CTTCAAAAAC
GGCAAGCCTT TAGGTGCTCC TGAAGACTTT TTAACAGGCT TTATTGCTAA TGCAGATGAA
GTATATGGCA GACCTGTTGG CATTACGGTA TTACCTGACG GTTCTATCCT GGTAGCAGAT
GATGCAGCTA ACACGATATG GAAAATCAGC GTTGCGAAAT AA
 
Protein sequence
MKQFLFISMC MGVLVSCAEK KEQAAVLDDT IATEVTDTFP KPFETESVRK FSKALGWKEG 
QAPTAPAGFV VQKYADKLEN PRWLYVLPNG DVLVAEAGTK GALKNAASFI SGNSKSKLSD
GSADRITLFR DTDKDGMPDV REIFLEKLNQ PLGMLLIENT FYVANTDGVV SFPYKKGQTS
ITAKAKQITD LPAGGYNNHW TRNLLANKDE SKIYITVGSS SNVAEHGMDE ELMRANVLVM
NPDGSDKKVY ASGLRNPVGA AWAPDTKTFW TVVNERDGLG DELVPDYLTS VQEGGFYGWP
YSYWGQHADP RLEGKGMDLV KKAIVPDVAL GAHTASLGLA FYDQKAFPEK YHNGAFIGQH
GSWNRSVFSG YKVVFVPFKN GKPLGAPEDF LTGFIANADE VYGRPVGITV LPDGSILVAD
DAANTIWKIS VAK