Gene Hneap_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2036 
Symbol 
ID8535195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2179768 
End bp2181069 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content57% 
IMG OID646384417 
Productmetabolite/H+ symporter, major facilitator superfamily (MFS) 
Protein accessionYP_003263904 
Protein GI261856621 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.70971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAAC GCGACGACAC CCGCCCCAAG CTGCAAAACT CACCCAGAAA AGTCCTGTTT 
GCCAGCCTGA TCGGCACCAC CATCGAGTTT TTCGACTTTT ATATCTACGC CACTGCCGCC
GCTTTGGTGT TTCCAAAATT ATTCTTCCCT GAATCGGACC CAACGACGGC CGTCCTACAA
TCACTGGCAA CGTTTGCCAT TGCCTTCTTT GCCCGACCGG TCGGCGCTGC CCTGTTCGGG
CACTTCGGTG ATCGCGTCGG CCGTAAAGCC ACCCTGGTCG CCGCTCTGCT GACGATGGGG
CTATCGACCG TGGCGATCGG CCTGCTCCCG ACCTACGACA GCATTGGCGT CGTCGCCCCT
ACCCTACTTG CACTGTTTCG CTTCGGCCAG GGCTTGGGGC TGGGCGGCGA ATGGGGTGGC
GCGATCCTGC TGGCGACCGA AAACGCCCCG CCGGGAAAAC GCGCCTGGTA CGGCATGTTC
CCTCAATTGG GCGCGCCGAT TGGCTTCATT CTTTCCAGCG GCATCTTTCT GTTGCTGACC
GCGTTTCTGA CCGACCAACA ATTTTTCGAC TTCGGTTGGC GCATCCCTTT TCTTGCCAGC
GCGGCACTGG TGATCGTGGG GTTGTACGTA CGACTGAAAA TCACCGAAAC TCCTGCCTTT
CAGGAAGTGG TTGAACACAA CACACGGGTC AAAACACCGA TTGCCACGGT GTTTAAAAGC
CACTGGAAGC CCTTGATTGC AGGCACGGTC ATTGCCTTGG CGACATTTGT GACCTTCTAC
CTGATGACTG TGTTTGCACT GACCTACGGC ACGGCGAAAA ACGGTTTGGG CTACAGTCGT
GAAACCTTCC TGTTCGCCCA ACTGTTTGCG GTCTTGTTTT TTGCGATCAC CATTCCCGTT
TCCAGCCTGA TTGCGGATCG ATTCGGTCGG CGTTTGACGC TGATCGTGAT CACAATCGCG
ATCTTTTTGT TCGGCTTTGC GCTGGCCCCA CTGTTCGGTT CCGGCAATCT GACGGGCGTT
GTGGTGTTTC TGGTGCTTGG ACTGGGACTG ATGGGGCTGA CCTACGGTCC GCTCGGCACA
CTTCTCTCTG AGCTGTTTCC GACAGCCGTG CGCTACACCG GCACATCAAT GGCGTTCAAC
CTGTCCGGGA TTTTCGGCGC CTCGCTCGCA CCTTATGCGG CCACGTGGCT TGCCAGCCAC
TATGGGCTTA ACTATGTGGG CTATTATCTC TCGGCCGCTG CCGCACTGAC CTTGCTCGGC
CTGCTCTCCA TCAAGGAGAC CAAGGATAAA ACATTCCACT GA
 
Protein sequence
MDERDDTRPK LQNSPRKVLF ASLIGTTIEF FDFYIYATAA ALVFPKLFFP ESDPTTAVLQ 
SLATFAIAFF ARPVGAALFG HFGDRVGRKA TLVAALLTMG LSTVAIGLLP TYDSIGVVAP
TLLALFRFGQ GLGLGGEWGG AILLATENAP PGKRAWYGMF PQLGAPIGFI LSSGIFLLLT
AFLTDQQFFD FGWRIPFLAS AALVIVGLYV RLKITETPAF QEVVEHNTRV KTPIATVFKS
HWKPLIAGTV IALATFVTFY LMTVFALTYG TAKNGLGYSR ETFLFAQLFA VLFFAITIPV
SSLIADRFGR RLTLIVITIA IFLFGFALAP LFGSGNLTGV VVFLVLGLGL MGLTYGPLGT
LLSELFPTAV RYTGTSMAFN LSGIFGASLA PYAATWLASH YGLNYVGYYL SAAAALTLLG
LLSIKETKDK TFH