Gene CHU_0873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_0873 
Symbol 
ID4185227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp997088 
End bp1000249 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content43% 
IMG OID638070875 
ProductCHU large protein 
Protein accessionYP_677496 
Protein GI110637289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.539179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATT TGTACTCAGC AAAAAAACTA TTGTTATTAT TTTTACTTAC CTCCATCTGC 
TCATTAGGTT ATGGACAGAC AATAGGTACG TACAATTTTA ATTCAGGAAC ATGTTCTACA
CAAACAGGCA CCGCGTCAGT TTCAAATGTA ACATTGAATG CTACTACTAC AGGTGCAGGT
CTTACCTGCA CAATTGGATC AGGAGCAATT ACATTAACGT CCACCTCTAA CTGGCCAGCG
TCGTTAACCT ATCCTGCAAA CAGTAATGCT TACTTAGAAT TTTCAGTAAC ACCTGCCGTT
GGATATGAAG TAAATATCAG TCAGGTCATT GTAAAAGCAG CCAGGGGAAA TGGTGGCGCA
AAAAATCTTA CTGTAGCATA TGACAACGGA TCCGGATATT CTACTGCGAC AAGTGCGTCT
ATAGCTCCTG CAACAGTAAC TACTTCATCA CTTGCTTTTA CGCTTGATAT TCCGGATGTC
TCGTCTACCT CCACTGTTAC ATTCCGTTTG TATGGATATA CAGGTGCAGT AACTTCTCCG
AAATCCTTAA TCACAGATTA CATTCAAATC GACGGAAATG TTGCCATTGC CAGTCCAACG
ATGCAATCTT CTATAGCTGT TACTTCAGCT ACAATCAATT CTGCAACACT TAGCTTTACG
GGCGGAAATG GTTCTCAACG TTTAGTACTT GCTCAGGAAG CAAGCCCGAT CAGCGCTGCA
CCGACTGATC TTACCTCCTA CAATGCCTCA AGTGTTTTCG GAGCATCCTC AACTCCTATT
GGAGGTGCAG CTTTCCCTGT TTACATCGGT TCTGGAAATG CGGTAACAGT TACCGGCTTA
AATCCATCTA CAACGTATTA TTTCTCCGTG TTTGAATTCA ACAAACTGGC TGGTACAAAT
ACTGAAAACT ATTTACTGCC TGGCGGTTCA ACATCCGTCA AAACCAATAA AGGTATTTAT
ACCTGGTCAG CAGGAAGTTC CGGATCATGG GCAAATCCTG CAAGCTGGAC ACCTAACAGA
AATTCACCGG CCTTTGACGG TTCGGATTCG TTGATCTTTA ACAGCGGCGG TACTATTACG
GTGACAAATG TTACTGATCA GGATGTATTT AGCGGACTTT CTATTTTTAA TAATACACAT
TTAATATTAA GTGCATCTGC ACCAACAACT TTATATGCTA TAGATTTTTC CGGATATGAA
ATCGCTATTG AATCCGGCTC TACTCTTGAG ATCAACAGCA CAAACGATTT CTTCCTATAT
CTGGATTATA ATTCCAGCCT TACTTCAGAA GGCACATTAA TTCTTGCTAA AACAAACAAT
ACCATCATCG GTACAGGAGA TATAACAATC AATGGAACAA TTTCGCTCGT ACATCCTGAT
GGACTTTACG GTACATCAGG ATCCAATGCC ATTGATGCAG GAGCAACATC CCTTACGCTG
GGCGCTGCAA GTACTGTAAA CTATGCCGGT GCAGCACAGA CAATCACTAC TGCCAATCCT
ACCCCATATG CAAACCTTAC ATTAAGCGGT TCTGGCACTA AAACACCTGA TGGCAACCTG
GATGTACACA ACCTTACCCT GAGCGGTACT TCGGTATTAG CACTTGCTGC TTATGAGTTA
AGTGTAGCCG GTAACTGGAC CAGCTACTCT GCTGCAGCAT TTACGGAAAC AGGCAAAACC
GTAACATTCA ACGGTACCGC AGCACAGACT CTTTCAACTA CAGGAGGTGA AATATTTGAA
GGGCTAAATC TCAACAACTC CTCTCTTACA TTATCTTCTC CGCTGCGCGT AAACGGACTG
GTAACACTTA CCGCCGGTAC ACTTACCGCC GGCGGAAATC TTACCCTGAA CCTGGCAACA
GCTAAAATCG ACGCTGCTGG TTCTGGTTCG ATCAGCGGAA ACATGAAAGT GATCCGTAAC
TTTGTACCGG CCAAAACGCA TTATATTGCC TCTCCCTTAG CTGGTGTTAC CGCTGCTGAC
ATCGCCGATG AATATCCTGT AGTATCAGGA AGCCAATCCC GTTTACTTGA TCTGGACTGC
TCTACAAACA AATTCTTTGG TATTTATGAC ATGAGTACTC CGCTGGCACA GGGGGATGCG
CTTTCTATGT TCTTCCCTGC TTCAGGAACG CCTGCAAACG GAGTGAACAC CGTTACATTC
ACGGGTACTT ATAATCATGC TGCTGCAAGC TATTCTTCTT CTTGCCCGGT AAATACTTCT
GTAAAAGACT TCTTTGCAGG TAACCCTTAT CCGTCTAACT TATTCTTAGG TGGAATTACC
GGATCAGGTA CTTCAGGTAA TTACTATGTG TTTAACAACA ATACATTCAA TGTATATTCT
TCTGTAACAG GAATCGGTAC TGGTGTAATT GTGAATGGAT ATGTGGCACC CATGCAGGGC
TTCCAGGTGG AAACTGACGG AACGGGCGGA ACCGCTTCGG TAACAATCCC TTCAAGTGCC
AGAGATGTTG CACAGACTAC TTCCTATGCC CGTACCGCTG TGGCTGATAA CATCATCCGC
TTACAGGCGT CAAACGGTAC ATTTACCGAT GAAACCGTTA TCCTGTTTAC TGATGCTGCT
ACAAATGGTT TTGATCTGGG ATATGATGCC GGTAAAATGC GCAATACTGG AACGAACACT
CCTAACCTGT ATACATTAGC CGATACAACA AAACTGATCA TTAACAGCCT GGCGCCCCTT
ACAAATACCG TTGATATTCC CTTGAACATC ACTACTACCA CTACTTCTAA CTATACCCTG
TCTTTCACAA ACAAAGACGC TTTTGATACT TTCAAATCGG TATACCTGAT CTTCCCGGAT
GGAGCACTGC ACAACTTAAC ACAGAACCCT GCCGTAACCG TGGCAACAAA CCCGGCAACA
CCATACATAC TACGTGTAGG CACTGAAAAC ATTACTACTT CAGCTTCTAA AGCAAAAACA
AATAATGCTT TAAAAGCTTA CATGAATGAA GATATGCTGG TTGTTACAAT GAACAGCAAC
ATTCAGCAGG TGGAAGTGTA TGACCTAACA GGAAACAGAA TCGCGGAAGG AACAACGGAT
GCGGGTTCCT TCACTACTAA ATTATCTGCT AAAGCGGGTA TTTATGTGGT AAAGGTACTT
TCTGAAAATA ACCTGTACAC AACTAAAATA GCAGTTAAAT AA
 
Protein sequence
MKYLYSAKKL LLLFLLTSIC SLGYGQTIGT YNFNSGTCST QTGTASVSNV TLNATTTGAG 
LTCTIGSGAI TLTSTSNWPA SLTYPANSNA YLEFSVTPAV GYEVNISQVI VKAARGNGGA
KNLTVAYDNG SGYSTATSAS IAPATVTTSS LAFTLDIPDV SSTSTVTFRL YGYTGAVTSP
KSLITDYIQI DGNVAIASPT MQSSIAVTSA TINSATLSFT GGNGSQRLVL AQEASPISAA
PTDLTSYNAS SVFGASSTPI GGAAFPVYIG SGNAVTVTGL NPSTTYYFSV FEFNKLAGTN
TENYLLPGGS TSVKTNKGIY TWSAGSSGSW ANPASWTPNR NSPAFDGSDS LIFNSGGTIT
VTNVTDQDVF SGLSIFNNTH LILSASAPTT LYAIDFSGYE IAIESGSTLE INSTNDFFLY
LDYNSSLTSE GTLILAKTNN TIIGTGDITI NGTISLVHPD GLYGTSGSNA IDAGATSLTL
GAASTVNYAG AAQTITTANP TPYANLTLSG SGTKTPDGNL DVHNLTLSGT SVLALAAYEL
SVAGNWTSYS AAAFTETGKT VTFNGTAAQT LSTTGGEIFE GLNLNNSSLT LSSPLRVNGL
VTLTAGTLTA GGNLTLNLAT AKIDAAGSGS ISGNMKVIRN FVPAKTHYIA SPLAGVTAAD
IADEYPVVSG SQSRLLDLDC STNKFFGIYD MSTPLAQGDA LSMFFPASGT PANGVNTVTF
TGTYNHAAAS YSSSCPVNTS VKDFFAGNPY PSNLFLGGIT GSGTSGNYYV FNNNTFNVYS
SVTGIGTGVI VNGYVAPMQG FQVETDGTGG TASVTIPSSA RDVAQTTSYA RTAVADNIIR
LQASNGTFTD ETVILFTDAA TNGFDLGYDA GKMRNTGTNT PNLYTLADTT KLIINSLAPL
TNTVDIPLNI TTTTTSNYTL SFTNKDAFDT FKSVYLIFPD GALHNLTQNP AVTVATNPAT
PYILRVGTEN ITTSASKAKT NNALKAYMNE DMLVVTMNSN IQQVEVYDLT GNRIAEGTTD
AGSFTTKLSA KAGIYVVKVL SENNLYTTKI AVK