Gene Sare_4657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4657 
Symbol 
ID5705714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5276875 
End bp5277966 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID641274055 
ProductABC transporter related 
Protein accessionYP_001539401 
Protein GI159040148 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.400776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.472803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGG TTACCTACGC CAAGGCGTCC CGGATCTACC CGGGCACCGA GCGGCCCGCC 
GTCAACCAGC TCGACCTGGA GATCGGCGAC GGCGAGTTCC TGGTCCTGGT CGGCCCCTCA
GGTTGCGGTA AGTCCACCAG CCTGCGCATG CTCGCCGGCC TTGAGGACGT TGACGAAGGT
TCGATCTCCA TCGACGACCG GGACGTCACC CACCTGCCGC CGAAGGCCCG CGACATCGCA
ATGGTCTTCC AAAACTACGC CCTCTACCCG CACATGTCGG TGTACGAGAA CATGGCGTTC
GCCCTCAAGC TGCGCAAGAC GCCAAAGGCG GAGATCGACC GGCGGGTCAA GGAAGCGGCG
ACGCTGCTCC AGTTGGAGGA GTACCTCGGC CGCAAGCCGA AGGCGCTCTC CGGCGGCCAG
CGCCAGCGGG TGGCCATGGG CCGAGCGATC GTCCGCGAGC CGCAGGTCTT CCTGATGGAC
GAGCCCCTGT CGAACCTCGA CGCCAAGCTG CGGGTGCAGA CCCGCACGCA GATCGCGTCC
CTACAGGCCA AGCTCGGTGT GACCACCGTC TACGTCACCC ACGACCAGGT TGAGGCCATG
ACCATGGGTC ACCGGGTGGC GGTCCTGCTC GACGGCGAAC TCCAACAGGT CGACACGCCG
CGGGCGCTCT ACGACACCCC AGCCAACGTC TTCGTCGCCG GATTCATGGG CTCGCCGGCG
ATGAACATCA AGACCGTGCC GCTGAGCGAG AATGGTGCCG AGTTCGCCGA GATGCACATC
CCGCTGACCC GCGAGCAGGT CGAGGCGGCC CGCGCCGAGG GTGGTGACGA CAAGGTGGTG
GTGGGCTTCC GCCCGGAGGA CTGCGAACTG GTCAGCCCGA CCGAGGGGGG CATGCCGGTC
GTCGTCGAGC TGGTTGAGGA CCTCGGATCG GACGCGAACA TCTACGGCCA CGCCGCGTTG
GAAGGCGCCA ACGAACGGTT CGTGGTGCGC ACCGACCGGC GCACCATGCC CAACATGGGT
GGCACCGTGT TCGTCAAGCC GCGGGCCGGC CGCAGCCACG TCTTCAACGC GAAGACCGGC
CGCCGGATCT GA
 
Protein sequence
MATVTYAKAS RIYPGTERPA VNQLDLEIGD GEFLVLVGPS GCGKSTSLRM LAGLEDVDEG 
SISIDDRDVT HLPPKARDIA MVFQNYALYP HMSVYENMAF ALKLRKTPKA EIDRRVKEAA
TLLQLEEYLG RKPKALSGGQ RQRVAMGRAI VREPQVFLMD EPLSNLDAKL RVQTRTQIAS
LQAKLGVTTV YVTHDQVEAM TMGHRVAVLL DGELQQVDTP RALYDTPANV FVAGFMGSPA
MNIKTVPLSE NGAEFAEMHI PLTREQVEAA RAEGGDDKVV VGFRPEDCEL VSPTEGGMPV
VVELVEDLGS DANIYGHAAL EGANERFVVR TDRRTMPNMG GTVFVKPRAG RSHVFNAKTG
RRI