Gene Sare_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1963 
Symbol 
ID5705210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2258281 
End bp2259687 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content65% 
IMG OID641271468 
Productamino acid permease-associated region 
Protein accessionYP_001536839 
Protein GI159037586 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.281005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTC ATGGTGACCT TGCTAAATAC GGCTATCGCC AGGAGTTGAG TCGACAACTC 
CGGTTCCGGG ACCTGCTGGC GTATGGGCTG GTGTATATGG TGCCGATCGC GCCGATGGCG
ATTTTCGGTA GTGTGTATGC CGGTTCTGGT GGCATGGTGG CACTTGCCTA TGTGATCGGT
GTGGTCGCGT TGGTGTTCAC TGCGTTTTCG TATGCGCAGA TGGTACGGGC GTTCCCGATG
TCGGGCAGCG TTTACAACTA TGCGGGTCGG GGTATCAGCC CTCCAGTCGG TTTCCTCGCC
GGGTGGGTGA TCCTGCTGGA CTATGTGCTC GTGCCGGGCC TGCTGTATTT GGTGGCGTCG
GTGGCGATGC ACGCGACCGT GCCAGTAGTG CCGGTGTGGT TGTGGCTGAT CGGGTTCGTC
GCGGTCAACA CGATCGTCAA CTCGGTCGGC ATCCGGATGA CCGCGATGGT GACCCGGGTG
ATGCTCGTCG GCGAGCTGAT CGTCCTGGCG ATCTTCCTCG CTGTCGCCGG CTGGGCCCTC
GCCTCGGGCA GGGGGCGGTT TAGTTGGGAG GCCTTCTACA ACGCCAACAC GTTCACCTGG
TCGCTTGTTG CCGGCGCCGT GTCGATCGCG GTGCTGTCCT TCCTCGGCTT CGATGGCATC
TCGATGCTGG CGGAGGAGGC CAAGGGCGGC TCTCGGCAGA TCGGTCGGGC GATGGCCGCT
GTGCTGGTCC TGGCTGGCGT GTTGTTCATC GCGCAGACGT GGCTGGCCGC GATGCTCGTT
GCCGAGCCGG CCTCCCTGCG CGGTGATGGG GATCCGGACG GCACGGCCTT CTACGAGGCG
GCTGCGGTGG CCGGTGGGGG CTGGCTGGCG ACCTTGTGCG CGGTCGCGAC CGCGATCGCA
TGGGGATTGC CGAATTCGAT GGTGGCGCAG GTGGCCACAT CGCGGCTGTT GTATGCGATG
GCCCGGGACC GGCAGTTGCC CGCCTTTTTG GCGAAGGTGT CGGTACGCCG CAGCGTGCCG
ATCAACGCGA CCCTGCTCAC CGGTGCCGTG TCTCTGGTGT TGGGCCTGTC CATGGCGGCC
CGGGCGGACG GGATCACACT GCTGTCGTCG CTGATCAACT TCGGGGCGAT GGTGGCGTTC
CTGGTCCTGC ACGTCAGCGT GATCGTGCAC CACCTCATCC GCCGGCGCAG CCGCAACTGG
TGGGCGCATC TGGTCATGCC CGCTGTCGGA TTCGCGATTC TCTCCTACGT CGTGGTCAAC
GCCGATATCG CCGCGCAGCG CCTCGGTCTG ACCTGGCTCG CCCTTGGGGT CCTTGTCCTC
GCCGGCCTGT ACCTGTCCGG TCGCCGGCCG GCCCTGTCGG GCCTGGCGCC CGCGCAGACA
CACGATCATG AGATGGAGCG AGTGTGA
 
Protein sequence
MSLHGDLAKY GYRQELSRQL RFRDLLAYGL VYMVPIAPMA IFGSVYAGSG GMVALAYVIG 
VVALVFTAFS YAQMVRAFPM SGSVYNYAGR GISPPVGFLA GWVILLDYVL VPGLLYLVAS
VAMHATVPVV PVWLWLIGFV AVNTIVNSVG IRMTAMVTRV MLVGELIVLA IFLAVAGWAL
ASGRGRFSWE AFYNANTFTW SLVAGAVSIA VLSFLGFDGI SMLAEEAKGG SRQIGRAMAA
VLVLAGVLFI AQTWLAAMLV AEPASLRGDG DPDGTAFYEA AAVAGGGWLA TLCAVATAIA
WGLPNSMVAQ VATSRLLYAM ARDRQLPAFL AKVSVRRSVP INATLLTGAV SLVLGLSMAA
RADGITLLSS LINFGAMVAF LVLHVSVIVH HLIRRRSRNW WAHLVMPAVG FAILSYVVVN
ADIAAQRLGL TWLALGVLVL AGLYLSGRRP ALSGLAPAQT HDHEMERV