Gene Sare_4212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4212 
Symbol 
ID5707950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4780716 
End bp4781726 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID641273631 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001538984 
Protein GI159039731 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0203123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGCT ACGTCATCCG ACGGTTGCTC CAGTTCATCC CGACCGTGCT GGGCACCATG 
TTCCTGCTCC ACTACATGAC CTCGCTGGCG ATCCAGTTCA GCGGGAACCC GGTCCTGGCG
CTCTTCGGCG ACCGGACCCC GCCGCAGTCT GTGCTTGACG CGATCACCGA GCGGCTCGGT
TACTCCGACC CCTGCCTCGA CCAGAGGGGC AATCCCTGCC TCGGGCTCTT CGCCGATCGG
GTGAGCAATA TTTTCCTTCA CTTTGACTTC GGCATCAACC TCAATCGGGA AGAGGTCACC
GACATGGTGG CCAACGCCCT CCCGTTCACC CTGAAGCTGT TGGTGATCGC GATCGTCTTC
GAGGCGGTCG TCGGTATCGC GGCCGGGGTG TGGGCGGGTC TGCGGGGCGG CAGCTTCGCC
GACAACCTGG TGAAGATCAG CACCGTTTTC GTGATCTCTG TGCCGATCTT TGTGCTCGGC
GTCGTGGTGC GGGAGTTCGT CGGGGTCAAG TTCGGCAACA TTCTGCGTGA TCAGGAGTGG
ATTCCGGACG TTATCGCGAC GGGCGTCTTC AGTCCCGGCT TCAAGCCGGA CTACCCCTTG
GCCAGCCTGT TGATCCCGGG CATGGTTTTG GGCGCGGTCG CGCTCGCCAC CACCGCGCGC
CTGACCCGAA CCAGCATCAT GGAGAACATC CGGGCCGACT ACGTCCGGAC CGCTCGGGCC
AAGGGGCTGG CGAACAAGCG GGTCATTGGC GTGCACACGC TGCGTAACTC GTTGATCCCG
GTGATCACGT ACCTCGGTGT CGACATCGGC TCCGCCATGG CCGGCGCGGT GGTCACCGAG
ACCATCTTCA ACGTGCCTGG TATCGGACGG ATGGTGACGC ACGCCGCCCG TAGCGGTGAG
GCGGCCGTGG TCATCGGTGT GGTCACCATG CTGGTGCTGG TCGTTCTGGT CGCCAACCTG
CTGGTCGACC TCCTCTACGC CGTGCTCGAC CCAAGGATTC GCTATGAGTG A
 
Protein sequence
MGRYVIRRLL QFIPTVLGTM FLLHYMTSLA IQFSGNPVLA LFGDRTPPQS VLDAITERLG 
YSDPCLDQRG NPCLGLFADR VSNIFLHFDF GINLNREEVT DMVANALPFT LKLLVIAIVF
EAVVGIAAGV WAGLRGGSFA DNLVKISTVF VISVPIFVLG VVVREFVGVK FGNILRDQEW
IPDVIATGVF SPGFKPDYPL ASLLIPGMVL GAVALATTAR LTRTSIMENI RADYVRTARA
KGLANKRVIG VHTLRNSLIP VITYLGVDIG SAMAGAVVTE TIFNVPGIGR MVTHAARSGE
AAVVIGVVTM LVLVVLVANL LVDLLYAVLD PRIRYE