Gene CHU_3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3781 
Symbol 
ID4183803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp4373383 
End bp4374675 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content46% 
IMG OID638073767 
Producthypothetical protein 
Protein accessionYP_680356 
Protein GI110640146 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.42028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGTAG CAAGCGTTGT GTTATGGGGC AAACATGTTG GTGCGGTGTT ATGGAACTAC 
GACAAGGGTT ATGCAACGTT TGAATTTGAA GCAGGCTTTT TAAAGTCAGG TCTGGATGTT
GCACCTCTAA CAATGCCGTT AGCAAATGTC AGCAAAGGAG AAATTTTTCA GTTCGCGCAA
TTACCGAAAG AGACATTTCA TGGGTTGCCT GGCCTATTGT CAGATGCATT GCCTGATCGT
TTCGGGAATC AGCTGATTGA TTTGTGGCTG GCATCGCAGG GGCGTGATAA AGCAAGTATG
AGTCCTGTCG AACGGCTGTG TTACCTGGGG ACGCGTGGCA TGGGCGCGCT GGAATTTGAA
CCGACCGTAC GCAACGAAAA AGAACCTTCA AAAGGATTAG AAATCGGCGC GCTGGTAGAG
TTATCAAAAA AAGCGCTGTC GATGAAAGCT TCCTTAGACA GTCACTTTTC AAAGGAAGAT
GCAGAGACAT TTGCAGACAT TATAAAAGTA GGAACCTCGG CCGGTGGGGC ACGTGCCAAG
GCTGTGATTG CCTATAACGA GCAGACCGGT GAAGTGCGCT CCGGACAATT GCATTCGCCT
GCAGGCTTTG AGCATTGGCT GATCAAGTTT GATGGTGTAA CAAATGAACA GCTCGGTGAT
CCGAAAGGGT ATGGAAGGAT TGAATACGCG TATTACAAGA TGGCTATTGA TGCCGATATT
ATAATGATGC CAAGCCTGTT GCTGGAAGAG GGTGGCCGCG CACATTTTAT GACCAAGCGG
TTTGACCGCA TTGTCAACAA TGAAAAGCTG CACATGCAGA CCTTATGCGG CTTGTGTCAT
TTTGATTATA ACAATCCGGA AGCCTATGCA TATGAACAGG CATTTCAGGC CATGCGCCAG
TTACGCTTGC CCTATACCGA TGCGGAACAA TTGTACATAC GTATGGTATT TAATGTAGTG
GCGCGCAACC AGGACGATCA TACAAAGAAC ATCTCCTTTT TAATGGACAA AACGGGAAAC
TGGAGTTTAT CGCCGGCCTA TGACGTAAGC TATGCATACA ACCCGGAAAA TAAGTGGATC
GCCAAACATC AGTTAGCGGT AAATGGTAAG CGGGAAAATA TTATGCGTGA AGATCTGTTA
TCGGTTGCGA AACAGATGAA CATTAAAAAG CCGAAAGAGA TCATTGAAAA AATAACAGCG
GTAGTAAGTA ATTGGGAAGG ATATGCACAG GAAGCAGGTG TTCCCAAGAA TCAAATTACG
GCATTGGGCA AAACACATTT GCTGAAAATG TAA
 
Protein sequence
MVVASVVLWG KHVGAVLWNY DKGYATFEFE AGFLKSGLDV APLTMPLANV SKGEIFQFAQ 
LPKETFHGLP GLLSDALPDR FGNQLIDLWL ASQGRDKASM SPVERLCYLG TRGMGALEFE
PTVRNEKEPS KGLEIGALVE LSKKALSMKA SLDSHFSKED AETFADIIKV GTSAGGARAK
AVIAYNEQTG EVRSGQLHSP AGFEHWLIKF DGVTNEQLGD PKGYGRIEYA YYKMAIDADI
IMMPSLLLEE GGRAHFMTKR FDRIVNNEKL HMQTLCGLCH FDYNNPEAYA YEQAFQAMRQ
LRLPYTDAEQ LYIRMVFNVV ARNQDDHTKN ISFLMDKTGN WSLSPAYDVS YAYNPENKWI
AKHQLAVNGK RENIMREDLL SVAKQMNIKK PKEIIEKITA VVSNWEGYAQ EAGVPKNQIT
ALGKTHLLKM