Gene CHU_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1854 
Symbol 
ID4185924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2171808 
End bp2174987 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content43% 
IMG OID638071852 
Producthypothetical protein 
Protein accessionYP_678462 
Protein GI110638253 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00800733 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATCCC TGTTTGTATC CATTACCTTT TTTTGTTTTA GTATACTGAA TCTTTCCGCG 
CAAACCATTA CTTTTCCGGA TGCTAATTTT CAAAACGCAT TGCTTACACA TTGGCCGGCT
ATTGATATAA ACAACGACGG ACTTATTCAG ATAAGTGAAA TAGAAAATCT AACAAGTTTA
TATGTTGCAG GTAAATCAAT TTCAGATTTA AGCGGCCTTG AGGCATTTCC CAAATTAGTA
AGTTTAAACT GCAGCAATAA TAGTTTATCA CACATAGATC TGTCTCACAA TCCGGAGTTG
AAATTTTTAG AACTCGGCTG GAATAGCATT TCGAATATTG ATCTCAGCAA AAGTACGAAG
CTGCAGGTCT TAGGGCTTCA GGATAATGGC TTGACCTCTA TTGACGTAAC CAGTAATAAA
GATCTGCGGG AATTAAAAGT AGAATATAAT GCATTAACGC AGCTGGATGT ATCAGAAAAC
AGATTCATCT GGTATCTGAA TTTTTCTGAC AATCAAATAA GTACGATCAA TTTAAATCCT
ATCCGTTCAT TAAGTGTACT TTCTGCCGGA AATAATTTGC TTACTTCACT CAACTTAAGC
TGTCACACCA ATTTGTCTTA CGTTACAATT GACAATAACT CATTATTAAA TTCAGTATGC
CTGAATAGCT CGGATTATGC AGAACGCATG TCAACTACAG ACCCAAACAG TGCGTATTGG
ACGAAAAGTT CAAACACTGC CTGGGCAAAT TGTCCTTCGC TTCCGCTTGT TACACCTATA
CCGGCAACAA TTGTACCTAT TAACCTGTGT GAAACAGGAG ATATGACATT CAATGCCGTT
TCAGAACAGT TTATCTCTAA CCCAACCTAT CGCTGGCAAA AAGCAAACTC TGCTGCAGGG
CCATGGGTAA GTATTCCCGG CGCAATCAAT CAATCTCTCG TGCTTACAAA CCTGCAGATG
AGCGATAATG GTTCGATATA TAAAGTGCTG ATCAGCTCAA CGGATTATTG CGGAACAGGA
CATATGGAAG AGGCAGCCGG ACAAATCATA TTACATCCGG CGGTTTATCC AAAGGTTACA
GCAAGTGCTT CTGTAAGCGG CGCCATATGC GATAATGCAT TGCCAATCAC ATATACAGCA
ACGCCTGTAG CCGGACAAGG CTCTGCTCCT GCGTATCAAT GGTACTATTA TAACAATCCC
GGTTTCACAG CTATAGCAGG TGCTACCGAT ATTCAGTATA CACCTGCAGT GCTTCCGCCA
GATGGCACAC AATTATTTGC AGGTATGTTT ACAGATGAAA TGTGTGCGGT TGAATCCGGA
GTTGCATCCA ATATCCTTAC TGTTGCTGTT GTACCAGCTC CCGAACCAGT GATAACCACA
AGCGACATCG CTCTATGCAA TCCTTCCGGA TATGTCATAC AGACAAGCCA CACTGCGGCA
ACAGGCACAC AGTTTCAATG GTACAGAAAC GGTATTGCCC TTGCTGGAAG TAATGTACGG
AATCTTACCA TTCCTGGTGC CGGCACCTAT TATATGACTG AAGATAATGG CACCTGCACT
ACAGCTTCCG GTACTGTAAC AATTTCTACA TCATCTACTA CGCCAGATCC GCAGGTGAAC
GTCAGTTCTT CCCTTACGGG CGCAGCCTGC GACACGCTTC AAAAAATTAC ATATACTGCC
ACAGCAGTTG CCGGAACGGT TACAGCCCCA ACCTATCAAT GGTACAATGC TGTTACAGAT
ACACCTATCA GCGGCGCAAC CGATGCAGTA TACACACCTG CAGCCAAACC TATGAATGGA
GACCAGGTCT ATGTACAGAT ACACACAAAC GAAGCCTGTG TAGCAAATCC GGATGTAAAA
TCCATTACGT TAACTACTCA AATACTTACT ACACCTGCCC CGCTTATTAC ATCCGGCAAT
ACAGCATTGT GCAATCCATC CGGATATGTC ATACAGACAA GCCAGACCGC AGCAACAGGT
ACGCAGTTTC AATGGTTTAG AAATGGCAGT GCATTAACAG GAAGTAATAC ACCTACGCTT
ACCATTGCAT CTGAAGGCAC ATATTATCTG GTTGAAGATA ATGGTACCTG TAAAATATCC
TCAGGCAGTG TAACAATTTC TGCTATAGCC GCTATGCCGG ATCCACAGGT AAACATAAGT
TCTTCACTTA CCGGTACTGC ATGTGATACA CTTCAAAAAA TAACATATAC AGCCACACCA
GTTGCCGGCA CGGTTACAAC TCCAACCTAT CAGTGGTATA ATGCTGCAAC AGATACCCCG
ATCAGTGGAG CAACCGATGC GGTATATACG CCTGCTGCGA AACCTGTAAA CGGTGATAAG
GTATATGTAC AGATGCATTC AGCTGCAACA TGTGTAGCAC ATCCGGATGT AACGTCTGCG
ACCCTAACCG CTCACATCCT TACAACGCCT GCACCTGTTA TGCTGTCAAG GGACACAGCT
ATATGCATGC CGCAGGAATA TAAACTTTCA GCTAAACTTG CTTCGGGAAC TCAGATACAA
TGGTACAAAA ACGGCACACC AATAACGGGT AGCAGCCATG CTGCTTATAC GATTCATGCA
GATGAATTAT CGGGCGGTTC GTATCATGTA TCTGAAAGCA ATGGCGCCTG TACCATTCAT
GCAGATGCTG TAAACATTGA ACTGATGCAT ACACCATTAT TATATACAGA AAATGAGTTA
TATGTTTCCA AAGGAGAACG TGTTACATTA AATGTACAGG CAGAGCACGC AGCATATCAT
CATTGGAGTC CGGCTATTGG ATTGAGTAAT CCGGATCAGC TGATAACAGA ACATCTTGCA
ACCAACTCTG TTACTTACAC ACTTCATGCA ACAAATCAGT TAAATAAATG TCCGGTAAGT
ACTGAAGTAA CGATACGTGT GAAAGCAGAG GTAATTATAC CCAATGTAAT TACAGTGAAT
GGCGATGGTG TAAACGATAC CTGGGAAATT GAAAACATAG AAAACTTTCC GCAGGCCTCC
ATTGAAATAT TCAACCGCTG GGGAAATATG GTCTGGAAAA GTATCGGTTA CAGCAATCAA
TGGAACGGCA GCAGCAGAAA TGGTGAGCCA CTACCCGCAG GCACATATTA TTATATCGTT
AATCTCAACA GCAAACAACA AACGGATGCG TATACAGGCT ATATTCAACT AGTAGAATAA
 
Protein sequence
MKSLFVSITF FCFSILNLSA QTITFPDANF QNALLTHWPA IDINNDGLIQ ISEIENLTSL 
YVAGKSISDL SGLEAFPKLV SLNCSNNSLS HIDLSHNPEL KFLELGWNSI SNIDLSKSTK
LQVLGLQDNG LTSIDVTSNK DLRELKVEYN ALTQLDVSEN RFIWYLNFSD NQISTINLNP
IRSLSVLSAG NNLLTSLNLS CHTNLSYVTI DNNSLLNSVC LNSSDYAERM STTDPNSAYW
TKSSNTAWAN CPSLPLVTPI PATIVPINLC ETGDMTFNAV SEQFISNPTY RWQKANSAAG
PWVSIPGAIN QSLVLTNLQM SDNGSIYKVL ISSTDYCGTG HMEEAAGQII LHPAVYPKVT
ASASVSGAIC DNALPITYTA TPVAGQGSAP AYQWYYYNNP GFTAIAGATD IQYTPAVLPP
DGTQLFAGMF TDEMCAVESG VASNILTVAV VPAPEPVITT SDIALCNPSG YVIQTSHTAA
TGTQFQWYRN GIALAGSNVR NLTIPGAGTY YMTEDNGTCT TASGTVTIST SSTTPDPQVN
VSSSLTGAAC DTLQKITYTA TAVAGTVTAP TYQWYNAVTD TPISGATDAV YTPAAKPMNG
DQVYVQIHTN EACVANPDVK SITLTTQILT TPAPLITSGN TALCNPSGYV IQTSQTAATG
TQFQWFRNGS ALTGSNTPTL TIASEGTYYL VEDNGTCKIS SGSVTISAIA AMPDPQVNIS
SSLTGTACDT LQKITYTATP VAGTVTTPTY QWYNAATDTP ISGATDAVYT PAAKPVNGDK
VYVQMHSAAT CVAHPDVTSA TLTAHILTTP APVMLSRDTA ICMPQEYKLS AKLASGTQIQ
WYKNGTPITG SSHAAYTIHA DELSGGSYHV SESNGACTIH ADAVNIELMH TPLLYTENEL
YVSKGERVTL NVQAEHAAYH HWSPAIGLSN PDQLITEHLA TNSVTYTLHA TNQLNKCPVS
TEVTIRVKAE VIIPNVITVN GDGVNDTWEI ENIENFPQAS IEIFNRWGNM VWKSIGYSNQ
WNGSSRNGEP LPAGTYYYIV NLNSKQQTDA YTGYIQLVE