Gene CHU_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2139 
Symbol 
ID4187063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2483868 
End bp2485934 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content44% 
IMG OID638072139 
ProductCHU large protein, SAP or adhesin AidA-related 
Protein accessionYP_678744 
Protein GI110638535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.534536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAT TCTTAGGCTT ATCTGTTCTT ATATATATGC TGCTTTCGGG TATGATGGCA 
GCCGCACAAT GTCCGTCAGA TCCTACTGCA TCCTGCACGA GCACGATAAG CGCATCTTCT
GTTACAAACG TAACGGTTGC TGCGGGGCAG GTTGTGTGTG TTACCGGCGG CATTCATACC
GGTACATTTA ACGTTCAGAA CGGCGGTATC CTATTGGTGT CCGGTGGTAC AATCGGCCCA
CAGGTTGCTG TACAAAATGG CGGTAACCTT GTTGTTTCAG GAGGTATATT AACAGGGAGC
GCCAGTTTGC CTGCGGGGGC AAGTATGTTC ATAAAAAATA ATCCGGCATT AAACGGTACA
CTTGCTATGT CTGGCGGTAC CTTAAACGTA TTAAGCGGCG GTACACTTGC TAAAGACATT
AGTGCTACGG CTGCAAGTAC CATAAATAAC TGCGGTACAC TTTCCGGCAC ACGTAACTTC
AATACAAATA TCACACTAAA CAATTATTCA GCCAGCACTA TAACATTAAG TACCGGCGGC
AATACATTGA ATAATTATGC CGATAACGTG ACTATTGCTT ATAACAACGT AAATACAACA
AATACATTCA ACAACTACGG TACAGGCGTG AACTTCAGTA TAACAGGCGC ATGGAACAGC
GGGATGACCT TTAACAATGC TGCCGGTGCA GCCTTAACGG TTACTTCCGT ACCAGGTGGA
GCGATGCCTT CTGCTACGGT GTTCAATAAT GCAGGAACAC TTACCTATTC GCCGGTTTTA
AATACAAATG GCGCAACCTT CAACAATGCT GCAACAGGTA CATTTAATCT CTCAAGTTCC
GCATCTGCAA ACAGGCCTAA TATTACCAAT GATGGTACAA TGAATGTTTC AGGTACCTTT
TATTTAGCCG GCAATACAAC AACCAACAAT GGCACCATGA CCTTTTCTGA TGAGCTGCGT
CTGGACGGCG GCACATTGAA TCTTGGTGCT AATTCAACTA CCACAACAAA AACGTTATAT
AAAAACAACG GCAGCATTAA CATGTATGAC CATAGTGTTT TAAATATTGT ACAAAATGTT
ACTACATGGA ACGGCACTGC TATTCACCTG GTATCGGGTT GTGCTTCGGT ATTGGGAAGT
ACTACGCCAA GTACAACAAA TATAAACGCT ACTTTCCTGG ATAATGCAAA TATCAATTTC
TGCGGTGCTC CTCCTGCGCA GTCGCCATCA TATATTGCAA TCACGTCGGT AACCAATAGT
GCATCCAGTC CGGGCAGATA CAGAATTGCC TTATCCTCCG GGCCTGCAAC AAACGGTTAT
GTTCAGATCT CCGGTGTAAC AGGTGTTTCA GATTTGAATG GTTACTGGCA GGTGATCAAT
AACGGAAACG GTACATGGGA CCTTATCGGC AGCACATACA CAGCAGGCGC AGTGATCAGC
GGCAGCCAGG TAATCGTGGA TCAAACCAAA TTGAAATTAG GTCCAGGCTA TTACTTAGGC
TATTCCGGCT GCAGCAATCC ATGTGCTCCA CTGCCGATTA CGTTACTCTC ATTTACCGCA
GAGAAAGAAG ACGCGCATGT GGTTATTGAA TGGGTAACAC TGCAGGAAAA AAATAATCAG
TCGTATACAG TAGAAAGATC TGCAGACGGC ATTCATTTTG AATCTGTTGT TTCGCTTGCA
GGAAATAAAA ACAGTTCACA AAAAATGACC TACACGCAGT ATGATTTCAG CCCGTTACCA
GGCGTAACGT ACTACAGATT AAAACAGACA GACATGGATC AAACACATTC GTATTCAAGT
GTTGTTGCAG TAGATTCCAA TTCAGAAATA GATTGGACCA TCTACCCGAA CCCAAGTACA
ACGGGCGATT TCACGATCCT TTCTGCCTTT GCCGATAATG AAATTGTTGC TGTTACCGTT
ACAGACATGA CTGGCAATAC GGTTAGATCA TATGATGAAA GCTCGTACGA ACAGCAAATG
GAGATCAGCA ACCTGGGTAT GGGTTTGTAT GTGGTGAGTA TTCAAACAGT AACAGGCCAG
AAATCTAAAA AAGTAATTGT TCAGTAA
 
Protein sequence
MKRFLGLSVL IYMLLSGMMA AAQCPSDPTA SCTSTISASS VTNVTVAAGQ VVCVTGGIHT 
GTFNVQNGGI LLVSGGTIGP QVAVQNGGNL VVSGGILTGS ASLPAGASMF IKNNPALNGT
LAMSGGTLNV LSGGTLAKDI SATAASTINN CGTLSGTRNF NTNITLNNYS ASTITLSTGG
NTLNNYADNV TIAYNNVNTT NTFNNYGTGV NFSITGAWNS GMTFNNAAGA ALTVTSVPGG
AMPSATVFNN AGTLTYSPVL NTNGATFNNA ATGTFNLSSS ASANRPNITN DGTMNVSGTF
YLAGNTTTNN GTMTFSDELR LDGGTLNLGA NSTTTTKTLY KNNGSINMYD HSVLNIVQNV
TTWNGTAIHL VSGCASVLGS TTPSTTNINA TFLDNANINF CGAPPAQSPS YIAITSVTNS
ASSPGRYRIA LSSGPATNGY VQISGVTGVS DLNGYWQVIN NGNGTWDLIG STYTAGAVIS
GSQVIVDQTK LKLGPGYYLG YSGCSNPCAP LPITLLSFTA EKEDAHVVIE WVTLQEKNNQ
SYTVERSADG IHFESVVSLA GNKNSSQKMT YTQYDFSPLP GVTYYRLKQT DMDQTHSYSS
VVAVDSNSEI DWTIYPNPST TGDFTILSAF ADNEIVAVTV TDMTGNTVRS YDESSYEQQM
EISNLGMGLY VVSIQTVTGQ KSKKVIVQ