Gene Hneap_1483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1483 
Symbol 
ID8534641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1604157 
End bp1605425 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content52% 
IMG OID646383874 
ProductCapsule polysaccharide biosynthesis protein 
Protein accessionYP_003263362 
Protein GI261856079 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAGT TGGCATGGAA TCAATCAGCA AAAGCATCTC GGCATTTTGT TTTTTTGCAG 
GGTATGCCAT CTGCTTTTTT TCGTCTTGTG GGCGATGCGC TCGAAAAGAG TAGCCATCGG
GTGAGTCGCA TTAACTTTTG TGCGGGCGAT TGGTTGTTTT GGCATGATCG GCGCGCGTTG
AGTTATCGCG GCACCTTGGC CGATTGGCCG AACTTTTTTG GCCAATGGCT GCACGATGCC
CACGCCACCG ATGTGGTGCT GTTGGGCGAG CAACGCAAAT ACCACCGCGA GGCTGTTCAG
GTCGCCAAAG CGGCGGGCGT TCGGGTGTGG GCGACCGACT TTGGTTATCT GCGACCCGAC
TGGATTACGC TTGAAGCCAA TGGCTTGGGG GGTAATTCGA CCATGCCGCG CGATCTGGCT
GAAATTGAGC GATTGGCGGC AAATTTACCC CCCGTGGATT TTCAACGGAA ATATCACGAC
AGCAGTTGGC GCATGTCGCT GGGCGATTTG GCGGCCAGTT TTGCGACGGT GTTTTTTTCT
GTGTTTTATC CCCGTTATCA ACAATCGGAT GCCCGCCCGC ATCCTTTGAT TTATTTTCCG
GCGATGGGGT TAAGTTTGTT GGTTAAGGCG TGGCAGCAAA AACCCGCGCT GCGCCTATTT
GCACATATCA ATAGGCGGGG CAGGGCGTAT TTTGTGTTCC CCTTGCAACT TAATCACGAT
TTTCAAATTC AGGCCTATTC GCCCTTTGCG GGTATGGGCG ATGCCATCTC GCTTGTGTTG
GCATCGTTTG CGCGCCATGC CCCAATAGAA TGTGACTTGC TGATTAAAAG CCATCCTTGG
GATCCGGGGT TGCACAACTG GAATAAACAG ATTCAAAAAG AAGCCCGTCA GTTGGGTATT
GCCGAGCGGG TGTTTTATTT CAATGGTGGC GATTTGAATG CCATGATGCG CAAGGCGCGG
GGCGTAGTGA CGGTTAATAG CACATCAGGA TTGCAGGCGT TACAGATGGG GCGAGCGGTT
AAGGTGCTGG GTGCCTGTGT GTATGATGTG CCGGAGCTGG TGGATACCCA AAGCCTGGAT
GATTTTTGGC AGAACCCGAC ACCGCCGGAT GCCACGCAAT TAGCCGACTT TATTCGTTTA
TTGGTGCAGC AAACGCAAAT TCGCGGCGTG TTTTTTGGCC GCGCGGATGG CTCGCCAAGC
GTTATCAATT TTGCCCAACG ACTCACAGAA CCGTCTCATT TAACTCAGGA ACTGATTAAT
CATGCCTGA
 
Protein sequence
MSKLAWNQSA KASRHFVFLQ GMPSAFFRLV GDALEKSSHR VSRINFCAGD WLFWHDRRAL 
SYRGTLADWP NFFGQWLHDA HATDVVLLGE QRKYHREAVQ VAKAAGVRVW ATDFGYLRPD
WITLEANGLG GNSTMPRDLA EIERLAANLP PVDFQRKYHD SSWRMSLGDL AASFATVFFS
VFYPRYQQSD ARPHPLIYFP AMGLSLLVKA WQQKPALRLF AHINRRGRAY FVFPLQLNHD
FQIQAYSPFA GMGDAISLVL ASFARHAPIE CDLLIKSHPW DPGLHNWNKQ IQKEARQLGI
AERVFYFNGG DLNAMMRKAR GVVTVNSTSG LQALQMGRAV KVLGACVYDV PELVDTQSLD
DFWQNPTPPD ATQLADFIRL LVQQTQIRGV FFGRADGSPS VINFAQRLTE PSHLTQELIN
HA