Gene Sare_3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3065 
Symbol 
ID5706946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3473311 
End bp3474519 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID641272507 
Productmajor facilitator transporter 
Protein accessionYP_001537875 
Protein GI159038622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000480105 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTCCCCT CGTCGATTTC GCCACTGATC CGCTATCTAC TGTCAGCGTC CCTGGTCCGG 
TGCGCCCAGG GTGGTGCCAT CGTCGGACTG GTAACGCTGA CCATCGAATC CGGCGTACGT
GGCGGAACAG CCCTCGCCGG GGTGCTCGCC GCGTTACTCA CCGCCCCGCA CGTCGCCGGC
CCCTGGCTCG CGCGGTGGCT AGACACAACC CGAGACGGCC GACTCGTCCT GGCCGGATCC
TTTGGCCTAT TCGGTATTGG CCTGGCCGCC GGGGCGCTGC TACTCGGTCG CGTACCTGTG
CCTGTCGTCG CGGTGCTGAT CGTCCTGGCC GGCCTTGCCG GGCCACTGAT GATGGGCGGG
CTCAGCAGCC GGCTGACCAG CGTCATCGGC GGAAGACAGT CGCTGCTACG ACGGGCTGAG
GGCTGGGACT CGGCCACCTA CGGCCTGGCG AACATCGTCG GCCCTGCTCT CGTGGCTGCT
GTCGGCGCCC TAACCGGGCC GCTGTACCCC GTATTGGCGC TATCCGTCGC GGCCGCGCTA
GCAGCGGTGC TCGTGCTATC GCTGCCCATC CGAGAGCAAG CCACCGCGGA GCCGGAAGCC
GCGCTCGGCG TCGGCAAGGC GCTGCGAACG TTGGTCACCA TCGACCCCCT GCGCCGCGTC
ATGACCGCCA CCACCCTCAC CGCGGTGAGC ATGGGCGGCA TCATGGTCAT CGCCGTGGTG
TTCGGCGCCG AACTAGCCGG GCACCGCAGC TCTGGCCCAG CGCTCGCCGT GGCGTACGGG
GTGGGCAGCC TATGCGGATC GGTCGCGGTG GCCCTGATGC CACTTCAGGG CGAGCCCGAG
CGACTGACTA TTAAACTCAT CGCCGCCAAC GCAGTGGCAG TCGCGCTATG TGCGGCAGCC
CCGAGCTACC TGCTGGCCCT TGTCGCCTTC GCGCTCGCGG GCGCGAGCAG TGCGATCCTG
TTCACCGCCA CCCTCGCCGT TCGGTCGCTC TACTCGCCAT CAGGCGCCCG CGCGCAGGTC
TTCGTCTCCA TGGCAGGGAT CAAAATGGTC GCCGCCTCGG TGGGCACCGC GTTGGCGGGC
ACACTCGTCG CCGTCGGCCC CCGGTTCGCG TTACTAATCG GAGCCACGAT CACCGCCGTC
GCGGTGATCG TGGCGCTGGT CGACCGCCGC GTCACCCACC CCAGCTCTAA AGTCGCTGGT
ACATCCTGA
 
Protein sequence
MLPSSISPLI RYLLSASLVR CAQGGAIVGL VTLTIESGVR GGTALAGVLA ALLTAPHVAG 
PWLARWLDTT RDGRLVLAGS FGLFGIGLAA GALLLGRVPV PVVAVLIVLA GLAGPLMMGG
LSSRLTSVIG GRQSLLRRAE GWDSATYGLA NIVGPALVAA VGALTGPLYP VLALSVAAAL
AAVLVLSLPI REQATAEPEA ALGVGKALRT LVTIDPLRRV MTATTLTAVS MGGIMVIAVV
FGAELAGHRS SGPALAVAYG VGSLCGSVAV ALMPLQGEPE RLTIKLIAAN AVAVALCAAA
PSYLLALVAF ALAGASSAIL FTATLAVRSL YSPSGARAQV FVSMAGIKMV AASVGTALAG
TLVAVGPRFA LLIGATITAV AVIVALVDRR VTHPSSKVAG TS