Gene Sare_4779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4779 
Symbol 
ID5704446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5410200 
End bp5411159 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content66% 
IMG OID641274177 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001539523 
Protein GI159040270 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000244558 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGCCCGGT ACCTCATCAC GCGCCTGGGA CAGGCACTCA TCGTCGTCGT ACTCGTCACG 
GTGATCGCGT TCCTCATCCT GCACCTGCTG CCCGGGGGCG CCGCACGAGC CACCCTCGGC
AAGGAGGCGA CGCTGGAGCA GTTGGCGGCG TTCAACCACG AGATGGGCTA CGACCGGCCG
TTGATCCAGC AGTACGGCAT GTACGTACAG CGCCTGCTGC AGGGTGATCT CGGCTACTCG
TACCAGCTCA ACCAGTCCGT TCTCGAGGCG ATCGAACAGC GGCTACCCAA GACGATGGTG
TTGTCGCTGC TGTCGACCCT GCTCGCCGTC GTGTTGGCGA TCCCGCTCGG CGTGCTCCAG
GCGGTACGCC GCAACCGATG GCCCGACTAC GCCATCACCG CGCTGTCGCT GCTGGCGTAC
GCCACGCCCA TCTTCTTTCT GGGCCTCATG ATGATCATTG TCTTCTCGCA GGTCTGGCCG
ATCCTGCCCC CGGAGGCACC GCAGGGGTTC ACGGTGGCCG AGGTGCTCGC CGATCCGGCC
GGGCTGGTCC TACCCACGGC CACCCTCGCC ATCGTCACCA TCGCGGTCTA CGCGCGGTAC
GTGCGGTCAT CCATGATCGA CAATCTGAAC GAGAACTACG TGCGGACCGC CCGCAGCAAG
GGGCTCTCCG AGCGGCGGGT TGTGCTGCGA CACACCCTGC GTAACGGGCT GTTCCCGGTC
ATCACGCTGC TCGGGATGTA CCTGCCCGCG CTGTTCAGTG GAGCGCTGGT CGTCGAGTCG
CTGTTCAATT TCCCCGGGAT GGGCCAGCTG TTCTGGCAGG CGGCCCTCAA GCGGGACTTT
CCGATCCTGC TCGGGGTCAC CGTCATCATC TCGATCGCCA CGGTCGTCGG CGCGCTGATA
GCCGACCTGC TCTACGCCAC CGTCGACCCC CGAGTCCGAC TCCGTGGGAG TGCCACATGA
 
Protein sequence
MARYLITRLG QALIVVVLVT VIAFLILHLL PGGAARATLG KEATLEQLAA FNHEMGYDRP 
LIQQYGMYVQ RLLQGDLGYS YQLNQSVLEA IEQRLPKTMV LSLLSTLLAV VLAIPLGVLQ
AVRRNRWPDY AITALSLLAY ATPIFFLGLM MIIVFSQVWP ILPPEAPQGF TVAEVLADPA
GLVLPTATLA IVTIAVYARY VRSSMIDNLN ENYVRTARSK GLSERRVVLR HTLRNGLFPV
ITLLGMYLPA LFSGALVVES LFNFPGMGQL FWQAALKRDF PILLGVTVII SIATVVGALI
ADLLYATVDP RVRLRGSAT