Gene Sare_4122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4122 
Symbol 
ID5708102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4681951 
End bp4683222 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content72% 
IMG OID641273550 
Productmajor facilitator transporter 
Protein accessionYP_001538903 
Protein GI159039650 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACCG TTCTGCGTCG CCCGGACTTC CGGCTGCTCT TTGGTGGCCT ACTGGCCAGC 
ATGACGGCCG AGTCGATCCT GCTGCTCGCA CTCGCCGTCT GGGTCAAGGA GTTGACCGGA
TCGAGCGGAC TGGCCGGGGC CACCATCTTC GCGATCATCG CCCCGATGAC GCTGGCGCCC
CTGGTTGGCT GGATCGTCGA CCGCTACCCG CGCCAGCCCC TGTTCGTGGC CGTCAACCTG
GTCACCGCCA CCCTGCTCAC CCCGTTGTTC GCCGTCCGCG ACCGCACCGA CCTCTGGCTC
GTCTACCTGG TCGCGATCCT CTACGGCCTG TCCTACGTCA CCCTGAGCGC GGTGCTCAGC
GGCCTCATCC GTAGCCTGGT CCCGGCGGAG CTGCTGGCCG ACGCGAACGG CGTGCTGCAG
ACGGTGCGGC AGGGCCTGCG GCTGATCGGC CCGCTGGCCG GGGCCGCCCT CTACTCCACC
GCCGGCGGTG GACTGCTGAC CGGGGTAGCG GTGACCGGCT TCGTGACCGC AGCGGTGCTG
GCGGGTCTGC TCCGGGTGAC CGAACCGCCC TGGTCTCGAC CGGAACCGCG GCGCTGCGTC
CGTACCCGCT CGTCGCCTCG GTCCGCCGAG CTGGGCGCCG GCCTTCGGTA CCTGGCCGAC
GAGCCGGCGC TGCGCCGGGC CCTGCTCGGC TACGGGCTCG GCTCGTTGGT GATGGGCTTC
AGCGAGTCAC TGATCTTCGC GTACGTCGAC CAGGGGCTTC GCCGTGACGC GACGTTCGTG
GGCGTACTCG TCACGGTGCA GGGAGCCGGC GGTCTAGCCG GCGGGCTGGT GTCCCCAGGT
GTGATCCGGC GGGCGGGGGA GGTGGGCGCG CTCGCGGTGG GGGTGGCCCT GTTCGGGGTG
GCCGCGCTGG CGTTGTCCTA CCCGAACCTG TGGCTGGGCT GCACCGCGGT CCTGCTGGCC
GGGGCTTCCC TGCCGCTGAC CATGGTCGGG CTGCACACGC TGATCCAGCG GCGCACCCCG
CCACGGCTCA TCGGACGGGC CGCCGCCGGT GCCGAGGCGG TGGTCAGTGG CCCACGGGCA
GTGTCCATCG GCGTCGGCGC ACTGCTCGTG GGGGTCATCG ACTACCGTCT GCTGTTCGTG
GTGGTGAGCC TGGTCACCCT ACTCGCCGGC GGCTACCTCT GGGGCGGTCG CCGGTTGAGC
CGACCACCCC AGCCACCGCC GCCCGCAGGG CGCACACAGG TCAGCCAACA ATCGTGGACA
GGTGCTCCTT GA
 
Protein sequence
MRTVLRRPDF RLLFGGLLAS MTAESILLLA LAVWVKELTG SSGLAGATIF AIIAPMTLAP 
LVGWIVDRYP RQPLFVAVNL VTATLLTPLF AVRDRTDLWL VYLVAILYGL SYVTLSAVLS
GLIRSLVPAE LLADANGVLQ TVRQGLRLIG PLAGAALYST AGGGLLTGVA VTGFVTAAVL
AGLLRVTEPP WSRPEPRRCV RTRSSPRSAE LGAGLRYLAD EPALRRALLG YGLGSLVMGF
SESLIFAYVD QGLRRDATFV GVLVTVQGAG GLAGGLVSPG VIRRAGEVGA LAVGVALFGV
AALALSYPNL WLGCTAVLLA GASLPLTMVG LHTLIQRRTP PRLIGRAAAG AEAVVSGPRA
VSIGVGALLV GVIDYRLLFV VVSLVTLLAG GYLWGGRRLS RPPQPPPPAG RTQVSQQSWT
GAP