Gene Sare_4780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4780 
Symbol 
ID5704447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5411156 
End bp5412058 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content68% 
IMG OID641274178 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001539524 
Protein GI159040271 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.684387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000235996 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCACGC TGTCCGAGGC GGTCGGCCCG GCCGAGCCGA AGGACGCGCC ACCGCGCGGC 
CTGTGGCGGC AGGGGCTGAT CGTCTTCTGC GAGAACCGGC TTGCCCTGGT GGGGCTGTGT
CTGTTCGTCC TCCTGGCCGG GGTCTGTTTC CTCGGACCGT TGGTGTACCA GACCGACCAG
GTACACACGG ACCTCACCGC CGTACACCTG GCCCCGGGAG AGCAGGGACA TCCACTCGGC
ACCGACGGGG TCGGCTACGA CCAGTTGGGG CGGTTGATGC TCGGCGGCCA GACCTCCATC
ATCGTCGGAC TGGCTGCCGG GATCCTCGCC ACCATCGTTG GCACGCTTCT GGGCGCCATC
GCCGGCTTCG TCGGCGGCTG GGTGGACGCC GCGGTGATGC GCGTCGTCGA CGCGATGATG
TCGATCCCGT CGTTGTTCCT GTTCATGCTG CTCGCCGCCA TCGTCACACC GAGCGTGCCG
ATGCTCATCC TCATCATCGG CGCCTTCGCC TGGTTGGGTC CGGCCCGGCT CGTGCGAGGC
GAGGCACTGA CGCTGCGCTC ACGCGAGTAC GTCCAGGCGA TGCGCGGGAT GGGTGGCACG
GGCGGTCGTG CGGTCCGTCG ACACATCATC CCCAACGCCA TCGGCACGGT GATCGTCAAC
GCCACCTTCC AGGTCGCCGA CGCCATCCTC TACGTCGCCT ACCTGTCCTT CCTCGGCCTC
GGCGTCCCCC CACCGGCGGC GAACTGGGGT GGCATGCTCT CCGATGGTCT GGCGGACACC
TACAGCGGCC ACTGGTGGCT GTTGTACCCG CCCGGGATCG CCATCATCCT CATCGTCCTC
GCCTTCAACT TCATCGGTGA CGGGCTGAGG GATGCCTTCG AGGTTCGCCT CCGGCGACGC
TAG
 
Protein sequence
MSTLSEAVGP AEPKDAPPRG LWRQGLIVFC ENRLALVGLC LFVLLAGVCF LGPLVYQTDQ 
VHTDLTAVHL APGEQGHPLG TDGVGYDQLG RLMLGGQTSI IVGLAAGILA TIVGTLLGAI
AGFVGGWVDA AVMRVVDAMM SIPSLFLFML LAAIVTPSVP MLILIIGAFA WLGPARLVRG
EALTLRSREY VQAMRGMGGT GGRAVRRHII PNAIGTVIVN ATFQVADAIL YVAYLSFLGL
GVPPPAANWG GMLSDGLADT YSGHWWLLYP PGIAIILIVL AFNFIGDGLR DAFEVRLRRR