Gene Hoch_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0213 
Symbol 
ID8542592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp314182 
End bp315447 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content66% 
IMG OID646385009 
Producthypothetical protein 
Protein accessionYP_003264747 
Protein GI262193538 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4966] Tfp pilus assembly protein PilW 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAC ACCTCTCCAT GCACCCGACC GCCGCACATC GCCGCACGCA CGACCAGCGA 
GCGAACGAGC GCGGCCTCAC CATGGTCGAG CTGCTGATCA CCTTGGCCGT GGGCTCGATG
CTGCTCGGCT TCGTCTTCAA CATCCACGGA CGCATGTCGC GCGCGTATCG CTCGCAGACC
AGCGTGGGCG AGCTACAGCA GTCCACGCGC GCGGCCGCGG ACCTCATGGC GGCCCATGTG
CGCACGGCCG GTTTCATGAT GCCAAAGGGC GCCTACGTGA GCGCCGCCAT CGCAGCGCTT
GATGCCGGAC GAGCGCCCGA AAACCGGGGT ATTGTGGATA TAGACAGCGA CGGTGCCCCC
TTCGTGCCGG CGCTGGAGAT CGCCAACAAC CCGACGGGCA TGGTCCCGGG ACGCATGGAG
CCCGACCGGC TTCACACCTT CTCGGCCAAC TCTTTCGGAC AGACGACCGT CATCGCCGAC
CCGGGTCTCG ACACCATCAC GGTCGCCAAT GCCGGCTTCT TCAACGGCTA CACCGGGCTC
GCGCTCATCA CCAAAGGGCC GGTTCAAAAA CCGCACCCGC AACTCGGCAC GATGCCCGAT
ATCACCACCT ACAACGCCTG TATCGTACGG GTGACCGGTG CGGCTGCGAC CCAACTGTTC
TTCTTCGACG AGGGCCAGTT CAACAGCGGC GGCCTCGCGC ACTGCCGCCT CGGCCCCGAC
ACCGCCGGTG AACCGCGGCG CGCGCTTGAG CCAGGCTCTC AGGTCTTCCG CCTGGACGCG
CGCGCCTTTC GAGTCGACGT AGACCCTGGC GACGACGCGA AAACCCAACT CGCCGCGCTG
CAGATCTCGG CCACCGGCGG CCTGCTCGAC GACTGGCTCA CCGTGGGCGT CGGCTACGTC
GATCTGCAGC TCACGCAGCG CATCAGTGAA CCCTCCAACG GAGACGCTGA CGACCTGGAC
AACGACGGAG ACCTCCGCGC CGACTGGTAC GCGGCACCGT ACACGCTCGC CACGAATGAA
GCCATCACAC AGGTCGGCAT CTCGGTCGTC GTGCGCTCCG ACAGCAACGC GTCGGACACT
CAGACCCTCA TCGTCCCCAC GCCGCGCAAT GAGACCGGCG GTTTCACGGC GGCCAACAAC
CCCGAAGGCG ACGCCAGCGA CAAGAGCTTC TCGCCGACCG CGCCGCCTCC CGAATACGCC
GGGGAACACA TTTATCGCCA GAGTTCGCTC ATCGTAGACG TGCGCAACCT GGGCGCGTCA
TACTGA
 
Protein sequence
MSRHLSMHPT AAHRRTHDQR ANERGLTMVE LLITLAVGSM LLGFVFNIHG RMSRAYRSQT 
SVGELQQSTR AAADLMAAHV RTAGFMMPKG AYVSAAIAAL DAGRAPENRG IVDIDSDGAP
FVPALEIANN PTGMVPGRME PDRLHTFSAN SFGQTTVIAD PGLDTITVAN AGFFNGYTGL
ALITKGPVQK PHPQLGTMPD ITTYNACIVR VTGAAATQLF FFDEGQFNSG GLAHCRLGPD
TAGEPRRALE PGSQVFRLDA RAFRVDVDPG DDAKTQLAAL QISATGGLLD DWLTVGVGYV
DLQLTQRISE PSNGDADDLD NDGDLRADWY AAPYTLATNE AITQVGISVV VRSDSNASDT
QTLIVPTPRN ETGGFTAANN PEGDASDKSF SPTAPPPEYA GEHIYRQSSL IVDVRNLGAS
Y