Gene Sare_4849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4849 
Symbol 
ID5707628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5503942 
End bp5505162 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID641274245 
Productprotein-N(pi)-phosphohistidine--sugar phosphotransferase 
Protein accessionYP_001539590 
Protein GI159040337 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2213] Phosphotransferase system, mannitol-specific IIBC component 
TIGRFAM ID[TIGR00851] PTS system, mannitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0104917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAA GCAACTACAC ACCGACAGTC CAGGGCACCG GCGTCAAGGC CACCATCCAG 
CGGATTGGTG GCTTCCTCGC CGGCATGGTG ATGCCCAACA TCGGCGCCTT CATCGCCTGG
GGTCTGATCA CCGCCGTGTT CCTCGAAGAC GGCGGTTGGG TTCCCAACAA GGACTTCGCC
GCCCTGATCG GGCCGATGAT CAACGTGCTG TTGCCAGTCC TCATCGGCTA CACCGGCGGC
CGTCTCGTGT ATGGCCAGCG CGGCGCCGTC GTCGGTTCGG TCGCCACCAT TGGTGTCGTC
GTCGGATTCG AAAAGCCCAT GTTCCTCGGC GCGATGGTCA TGGCGCCGTT GGCCGCGTAT
CTGCTGAAGC GCATTGACGG GCTGTTCCAG GACCGGATCC GGCCCGGGTT CGAGATGCTG
GTCGACAACT TCACCGCCGG CATCCTCGGC GCGGGCATGG CCCTACTCGG CGTGTGGGCG
GTCGGCCCGA TCGTCGGTGG TCTGACCAAC CTCGCGGGCG ACGGTGTGGA CTGGCTGGTC
TCCCACCACA TGCTCGGCCT GGTGTCGATC ATCGTCGAGC CGGCGAAGGT GTTGTTCCTC
AACAACGCCA TCAACCACGG TGTGCTCAGC CCGCTCGGGG TGACCGAGGC CGCCGAGGCC
GGCAAGTCGA TCCTGTTCAT GGTCGAGACG AACCCGGGGC CCGGGCTCGG TCTGCTGCTG
GCGTTTTTCT TCTTCGGCCC GCGGTCACTG CGTCCGACCA CCCCGGCCGC GATGATCATT
CAGTTTTTCG GTGGCATCCA CGAGGTGTAC TTCCCGTACG TGCTGATGAA GCCCCGCCTG
ATCCTGGCGA TGATCGCGGG CGGTGCCGCC GGTGTCTCCA CCTTCATGAT TACCGGGGCC
GGTCTGGTAG CCGGCCCCTC ACCCGGCAGC ATCATGGCCT ACTTCGCGGT CACCCCGAAG
GGTGGCTGGT TCAGCATGCT TCTCGGCATT GTGATCGCCG CCGCGGTGAC CTTCGCAGTC
GCGGCCCTGC TGCTCGGATT CGGGCGGAAG GCGGGAGACG AGGCCGACGA CGCGGCCGTC
ACCCCGGAGG AGCAGGCCGA CCGCGAGCAG GCGGAACTGG CCGCGGCGCA GCGCCGCTCG
GCCGACAACA AGAACCTGGC GCCGGCCTCC CGCGGATCCG GCGACGACAC CGCGCCACAA
GCGACTGCGA AGGAGTCCTG A
 
Protein sequence
MTTSNYTPTV QGTGVKATIQ RIGGFLAGMV MPNIGAFIAW GLITAVFLED GGWVPNKDFA 
ALIGPMINVL LPVLIGYTGG RLVYGQRGAV VGSVATIGVV VGFEKPMFLG AMVMAPLAAY
LLKRIDGLFQ DRIRPGFEML VDNFTAGILG AGMALLGVWA VGPIVGGLTN LAGDGVDWLV
SHHMLGLVSI IVEPAKVLFL NNAINHGVLS PLGVTEAAEA GKSILFMVET NPGPGLGLLL
AFFFFGPRSL RPTTPAAMII QFFGGIHEVY FPYVLMKPRL ILAMIAGGAA GVSTFMITGA
GLVAGPSPGS IMAYFAVTPK GGWFSMLLGI VIAAAVTFAV AALLLGFGRK AGDEADDAAV
TPEEQADREQ AELAAAQRRS ADNKNLAPAS RGSGDDTAPQ ATAKES