Gene Hneap_1910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1910 
Symbol 
ID8535068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2043235 
End bp2045103 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content54% 
IMG OID646384291 
Productglycosyl transferase family 2 
Protein accessionYP_003263779 
Protein GI261856496 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00799183 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAA ATACTGGGAG TTGTTCGCGG GCCGAGCACT ATTCGGATGC CTCCCCCAGA 
GCCCTGTTTG CAGTGAGTTC GCTGGGGCTG GGACATGCCA CCCGCAGCCT CGTTTTAATC
CATGCGTTTC TGGCCGATGG CTACCGGGTT ACGGTTATTT CAACGGGCAA TGCGTTGGCC
TTCATGCGCC TGGAGCTGGC CGATCACCCC GCTGTCGAAT GGCAGGATTG GCCTGATTAT
CCACCGCTGG AACGCGGGAC TGGTTGGCGG TTCTACGCTT ATCTTGCGCT TGATCTGTTC
ACCACTTGGC GACGAATCCG CGAAGAACAC CATCGGTTTG AATCCATTGC CTGTGATTAT
GATTTTGTAT TCAGCGACGG GCGTTACGGC ATGTACAGTC GCTGGGTTCC CTCATTCATT
CTGTCTCATC AAATTGCCTT CATTCCGCCC AAGGGGTTGC AGGAAGTCAG CTGGATGAGC
GATCACCTCA ACGTGGCCGC GCTCAAGCAG TTCGACCGGA TTTTTATTCC GGATTTCCCC
TGCCCGAGCC TGAATCTAGC CGGGAATCTT TCACATACGC CCGTTTTGCA GAACACCCGC
CGTGAATATG TCGGGATTCT GTCGTCCTAT CCGCATCAGG AGCTTGAGCA GGATATCGAC
TACCTGTTCG TGATCAGCGG CTATCTGCAC GAGCACAAGG GGGCGTTTGT CCGTGATCTG
CTTGAGCAGG CGCAAGCTTT GCCCGGCAAG AAGGTTTTTG TCCTTGGGGA TGCGAACGCT
GATCCGGCAC AGTATCGGGA CGCACTCTCT GACGATCTCG AAGTGCATCC GCTGGCGACG
GGTGATCTAA GGCAGGAATT GTTCGGTCGT GCCAAATGCA TTATCTCTCG AGCCGGGTAC
ACCACGGTGA TGGATCTGGT CGAGCATGAT AAGCGCGCGC TTCTTATTCC AACGCCGAAT
CAAACGGAGC AGGAATACTT GGGCTACTAC CTCGGTAGTC TTAAATATTT CGTCAGCCGG
AATCAGTCCG AAGCATTCGA TCTTGCCGCC GCTTTGGCGC AAACAGAAGA CACGCGGCTC
TTTGAGGCAC CCTGGCGGAC GCAGGAAGCG GTGCGCCGCA TACAGTCATC AATGAGCGAT
GCGCTCCACA AGAACTTTTT CTCGGTGATA GTGCCCGCAC ATAACGAGGC GGCGGTACTG
GCGGAAACGT TGGACAGTCT CCTTGCACAA AGCTATCCAG CAAATCGAAT GGAAATCATC
ATCGTGGAAA ATGGCTCAAC CGATGAAACC TGGGCGATTG CTGAACGGTA TGCGGCGCTA
TCTGACGGCA ACCTGTCGAT TCGTGCTCTG CAAAGCGAGA AGGGCGTTTC CAAGGCCAAG
AATGTGGGCT TGGCGGCCAT GAGTGTGCAT TCTGATTGGG TAATCTTTTG TGATGCCGAC
ACGCAGTTGG CACCCAAAGC CTTGCGTCAG TTCAACCGCT GGATCAATCA GCATGGCTCC
GAGAGTCTCG CAGTGGGAAC AACGCGTGTA CGACCCCTGA GCGTTAATCG TGTGTATGTG
CGGTTGTGGT TCAGGGCCTA TGACTTGATT CATCGACTGA CGCGCAGCTC TTACTCCATA
CAGTTGGCGC GTTCGCCAAT TGCGCGCGGG ATCGGTTTTC GTCCGGAATT GAGTTTTGCA
GAAGATCTGA CATTCATCAG TGAGTGTCGT CGCTATGGCC GGTTTTTCTA TATTCCCAGC
GATCAGGTGG CGACATCGAC CCGGCGATTT GAAGCCCAAG GCTATCTGAA GCAAAGCCTG
AAATGGTTGT TCGAGGCGCT GATGCCCATG CGGATGAAGC GGAAACGAGG ATACGATGTC
ATTCGCTGA
 
Protein sequence
MNVNTGSCSR AEHYSDASPR ALFAVSSLGL GHATRSLVLI HAFLADGYRV TVISTGNALA 
FMRLELADHP AVEWQDWPDY PPLERGTGWR FYAYLALDLF TTWRRIREEH HRFESIACDY
DFVFSDGRYG MYSRWVPSFI LSHQIAFIPP KGLQEVSWMS DHLNVAALKQ FDRIFIPDFP
CPSLNLAGNL SHTPVLQNTR REYVGILSSY PHQELEQDID YLFVISGYLH EHKGAFVRDL
LEQAQALPGK KVFVLGDANA DPAQYRDALS DDLEVHPLAT GDLRQELFGR AKCIISRAGY
TTVMDLVEHD KRALLIPTPN QTEQEYLGYY LGSLKYFVSR NQSEAFDLAA ALAQTEDTRL
FEAPWRTQEA VRRIQSSMSD ALHKNFFSVI VPAHNEAAVL AETLDSLLAQ SYPANRMEII
IVENGSTDET WAIAERYAAL SDGNLSIRAL QSEKGVSKAK NVGLAAMSVH SDWVIFCDAD
TQLAPKALRQ FNRWINQHGS ESLAVGTTRV RPLSVNRVYV RLWFRAYDLI HRLTRSSYSI
QLARSPIARG IGFRPELSFA EDLTFISECR RYGRFFYIPS DQVATSTRRF EAQGYLKQSL
KWLFEALMPM RMKRKRGYDV IR