Gene CHU_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_0003 
Symbol 
ID4186884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp1955 
End bp3355 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content39% 
IMG OID638070001 
ProductTPR repeat-containing protein 
Protein accessionYP_676637 
Protein GI110636430 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGAC AAAATAAACG ACAATCAGAT GATCGGGAAT TGGCCAGAAA ATTTGAGGAA 
CTTCTGAAAG AAGGAACTCC TGGCTTTTTT GACGTAAGTG TTTACGAAAG CATCATCCAA
TACTATCTAG AAGGAACAAA ATTCAGAAAA GCACACAGAA CATGTGATAT TGCTATGGAG
CAGCATCCGT ATTCTGTAGA AATTATGTTG CTTAAGGTTC AGGTATTAAT GCAGATGACA
CAATTTGAAG AAGCATTAGA GATACTTGAC AGAGCACAGC TATATCAACC CAACGATACC
GACATTCAAT TGCTGCGTGC CAATATCATG GCTCAGCAGG ATGATTTTGA AGGGGCAATT
GAATTGCTTG AAGAGATACT TACGCTTGCC GAAGAGAAAG ATGAAATACA TTACCACATG
GGTGTAATTT ATCAGGATAT GGGTAACTTT GAAGAATCAA TCAATCACCT GAAGGAGGCA
ATTATGCTGA ACTCTCAGCA CGAAGATGCC ATCTATGAAT TGTCCTATAG CCTGGAAGTA
CTTGACCGCC TGGAAGAGAG TATAGACTTC TTTAAACAGT TAATTGAAAA AGATCCGTAT
TCACATTTTG CATGGTTCTG CCTCGGGGTA TCGTATTTCA AGCAAGGTAA GCTGGATGAG
GCACTGGATG CATATGAATT TGTAATAGCG ATTAACGATA AGTACTCTTC CGCGTATTAC
AACATCGGGG AATGTTATGT GTACAAAAAT GAATATGAGA AAGCGCTTGA GTATTTCTTC
CAGACCATGG ATATGGAAGA TAAAACGGCA GATGTTTTTT ACAACATAGG TTTTTGTTAC
GAGCATTTGG GCATGCACCC GAAAGCCATT GAGTTTTACC GCAAAGCATC CAAAGCCGAT
GCGTACTTCC ATGAAGCATA CTATGGAATA GGCAAGTGCC TGGAAGCACA GGATAAATCC
TACGAATCCA TTCATTTCTT CAAAAGAGCG TTAAAGCTGG ATGAAGCCAA TGCTGAATAC
TGGCTTGCAA AAGCGAATGC GGAATATAAA ACGGGCAACA TCATTTCGAG TCTGGAAGCA
TTTGAAGAAG CCTGTGTATT AGAGCCTTCC AATCCGGAAG TATGGAAAAA CTGGTCGTTT
GTACACTATG AAAGTGGTGA TATGGACAAG GCAATCGATT TAATCAATGC CGGGATTGAT
GAAATGCCTG GTAACGCAGA TTTATATTAT CGTGCTGTAG CATACCTGAT TACAGCAGGA
AGGTATAAAG AGGCATTTAA TTATCTGGAA AATGCATTAA CTTTAAACTT CGATAGCCAT
ACGGTGTTGT TTGAATTTTT CCCTAAATTG GAAACTCAAA AGGCATTATT CAGAATTATA
GATCAATACA GAAATAAATA A
 
Protein sequence
MARQNKRQSD DRELARKFEE LLKEGTPGFF DVSVYESIIQ YYLEGTKFRK AHRTCDIAME 
QHPYSVEIML LKVQVLMQMT QFEEALEILD RAQLYQPNDT DIQLLRANIM AQQDDFEGAI
ELLEEILTLA EEKDEIHYHM GVIYQDMGNF EESINHLKEA IMLNSQHEDA IYELSYSLEV
LDRLEESIDF FKQLIEKDPY SHFAWFCLGV SYFKQGKLDE ALDAYEFVIA INDKYSSAYY
NIGECYVYKN EYEKALEYFF QTMDMEDKTA DVFYNIGFCY EHLGMHPKAI EFYRKASKAD
AYFHEAYYGI GKCLEAQDKS YESIHFFKRA LKLDEANAEY WLAKANAEYK TGNIISSLEA
FEEACVLEPS NPEVWKNWSF VHYESGDMDK AIDLINAGID EMPGNADLYY RAVAYLITAG
RYKEAFNYLE NALTLNFDSH TVLFEFFPKL ETQKALFRII DQYRNK