Gene Sare_4177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4177 
Symbol 
ID5703965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4745173 
End bp4746111 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content66% 
IMG OID641273604 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001538957 
Protein GI159039704 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0016505 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCCGAT TCATTCTGCG ACGACTACTC CAGATGGTCC TCGCCTTCTT CGGGACCACG 
CTGATCGTCT ACGCGTTGAT GTTCGCCGGC CAGGGAGACC CGATCCAGGC GCTCGCCGGC
GAACGCCCGG TCACCGAGGC TCAGCGGGCG TACCTGACGG AGAAGTTCCA CCTGGACCGG
ACCGGAATCG ACGGCTTCTT CTTTCGCTAC TTCGACTACA TCCGGAATCT CCTCCAGGGC
GATCTCGGCG AATCCCTCAC CGGCCGGCAG ATCGGCGACA TCCTCGCCGC CGCGTGGCCC
GTGACGATCA AGCTGGCACT GATCGCGATG ACGGTTACCG TGCTCTTCGG TGTCACCGCC
GGCGTACTCG CCGGAATCCG GCGGGCCAGC ATCTTCGACA ACTCGACGCT GCTGCTGACC
CTGATCGTGC TCGCTGTGCC GACCATCGTG CTGGCGCCTC TCGCGCAGTA CCTCCTCGGC
GTCCGGTGGC GACTCTTCCC GCCGACCGCC GGCTCCGACC CCGACTTCTA CGCACTCCTG
CTACCCGGGA TCGTGCTCGG TTCCCTCTCC CTGGCCACCG CCCTACGGCT GACCCGGACC
TCGGTGGCCG AGAACCTCCG CGCCGACTAC GTCCGCACCG CCCGGTCGAA GGGCCTGGTC
AAGCGACGGA TCGTCGGCAT CCACGTCCTG CGCAACTCAC TCATCCCGGT GGTCACCTTC
CTCGGCGTCG AGCTGGGCAA CCTGATGGGC GGCGCGATCA TCACCGAAGG GGTGTTCAAC
ATCCCCGGAG TCGGCTTCAA CCTCTTCCGC GCCATCCGCA CCGAGGACGG TCCGCTGGTG
GTGGGCATCG TCAGCGTGCT GGTCGTCGTC TACCTCGTCG CCAACCTGGT AGTGGACGTG
CTCTACGCCG TACTCGACCC GAGGATCCGC TATGAGTGA
 
Protein sequence
MVRFILRRLL QMVLAFFGTT LIVYALMFAG QGDPIQALAG ERPVTEAQRA YLTEKFHLDR 
TGIDGFFFRY FDYIRNLLQG DLGESLTGRQ IGDILAAAWP VTIKLALIAM TVTVLFGVTA
GVLAGIRRAS IFDNSTLLLT LIVLAVPTIV LAPLAQYLLG VRWRLFPPTA GSDPDFYALL
LPGIVLGSLS LATALRLTRT SVAENLRADY VRTARSKGLV KRRIVGIHVL RNSLIPVVTF
LGVELGNLMG GAIITEGVFN IPGVGFNLFR AIRTEDGPLV VGIVSVLVVV YLVANLVVDV
LYAVLDPRIR YE