Gene Sare_1731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1731 
Symbol 
ID5703430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2003236 
End bp2004765 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content71% 
IMG OID641271234 
Productamino acid permease-associated region 
Protein accessionYP_001536609 
Protein GI159037356 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.177585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00117638 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCTCGA CAGCAACTAT CGAACGACCG AGCAACGTCT CTGAGGCGCT GGCCCGCGGC 
CGGCTCGGCG TGCCATCGGT GGTCTTCTTC GTCCTCTCCG CGGCGGCGCC ACTGACCGTG
GTGGCGGGTG TGGTGACCAC CGGCTACGGC GTCATCGGGG TGCTCGGCAT ACCGCTGGCC
TTCCTCGCTG TCGCCGCCCT GCTCGCCCTC TTCTCGGTCG GCTACGTGGC GATGGCCCGC
CGGCTGGCCA ACGCCGGCGC CTTCTACTCG TACGTCTCGC GGGGACTGGG TCGGCCGGCC
GGGGTCGGCG CCGCCTGGGT GGCGCTGATC GCGTACAACG CGCTGCAGGT CGGGTTGTAC
GGCGCCATCG GCGCCGCCGC CGAGCCCGTC CTGGACCGGT TGTTCGGCGT CAGCGTGCAG
TGGTGGCTGG TGGCCCTCGG CGCCTGGGCG GTTGTCGCCG TGCTGGGCCT GCTCCGGGTC
GACATCAACG GCCGGGTGCT CGCGGTGCTG TTGCTCGCCG AGATCGCGGT GATCCTGGTG
TTCGACCTCG GTCAACTGGG CAACCCCGCC GGCGGCCAGG TCAGTTTCGC CGCGTTCGCA
CCGGACAACC TCTTCGTCCC GGGCATCGGC GCGGTGCTGG TGCTGGCGAT CCTCGGGTTC
GTCGGGTTCG AGTCGGCTGT GGTCTTCAGC GAGGAGAGCA AGGATCCCCG CCGCACGGTG
CCGATGGCCA CCTACCTGTC CATCGCGATC ATCGCAGGGC TCTACGCGCT CTCGTCGTGG
AGTATGACCG TCGCGGTCGG GCCGGACCAG ATCTCCGAAC AGGCCGGCGA GCAGAGCGTC
GCGCTGATCT TCAACCTGGC CGGTGAGCAC CTCGGTGACA CCGTCGTGAC CATCGGCCAG
GTGCTGTTCC TGACCTCGGT GGTAGCCGCC ATGATCTCCT TCCACAACAC CACCGCCCGG
TACGCGTTCG CCCTCGGCCG GGAGCGGGTG CTGCCGGCGG CCTTCGGGCG GACCTCGCCC
CGCAGCGGCG CCCCCCGCAC GGCGTCCCTC GCGCAGAGCA CGCTCGGCCT GGTGGTGATC
CTGCTGTACG CGGTCAACGG GTGGGATCCG ATCGTGCAGC TCTTCTACTG GTGCGGCACC
AGCGGTGGGT TCGGGGTGCT GCTGCTGATC GCCACCACGT CGGTCGCGGT GATCGCGTAC
TTCGCTCGCA CCGGCGTGGC GGAGACGCTG TGGCGACGGG CGGTGGCACC AGGGCTCGCC
ACGGTGGCGC TGCTGGGTGT GCTCACCCTA GCGTTGATCA ACTTTGCCGA CCTGCTCGGC
GTGGCACCGG ACCACGCCCT GCGCTGGGGG GTCCCGGTCG CGTACCTGGC TGCCGCGCTG
CTCGGTGTGG TGTGGGGGCT GGTGTTGCGG TCGAACCGTC CGAGTACCTA TGCCCGGATC
GGGCTCGGCG CGGAGAGCGC CGCCGCCACG GTTCGGCCCG AGACACCGAC GCTGCCGGCG
GTGCCGGATC GGACGGCGGC CCCGCGGTGA
 
Protein sequence
MSSTATIERP SNVSEALARG RLGVPSVVFF VLSAAAPLTV VAGVVTTGYG VIGVLGIPLA 
FLAVAALLAL FSVGYVAMAR RLANAGAFYS YVSRGLGRPA GVGAAWVALI AYNALQVGLY
GAIGAAAEPV LDRLFGVSVQ WWLVALGAWA VVAVLGLLRV DINGRVLAVL LLAEIAVILV
FDLGQLGNPA GGQVSFAAFA PDNLFVPGIG AVLVLAILGF VGFESAVVFS EESKDPRRTV
PMATYLSIAI IAGLYALSSW SMTVAVGPDQ ISEQAGEQSV ALIFNLAGEH LGDTVVTIGQ
VLFLTSVVAA MISFHNTTAR YAFALGRERV LPAAFGRTSP RSGAPRTASL AQSTLGLVVI
LLYAVNGWDP IVQLFYWCGT SGGFGVLLLI ATTSVAVIAY FARTGVAETL WRRAVAPGLA
TVALLGVLTL ALINFADLLG VAPDHALRWG VPVAYLAAAL LGVVWGLVLR SNRPSTYARI
GLGAESAAAT VRPETPTLPA VPDRTAAPR