Gene Strop_4497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4497 
Symbol 
ID5060987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp5087122 
End bp5088762 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content72% 
IMG OID640476764 
ProductNa+/solute symporter 
Protein accessionYP_001161303 
Protein GI145597006 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.108837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.145875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAACG GCTACGTCGT TCCGGCGATC GTCGCAGTCA CCCTGCTCAC CATCGGGATC 
GGCTTCTACG GTCTGCGGCT CGCCCGCACC ACCTCGGACT TTCTGGTGGC ATCCCGGGCC
GTCAGCCCGA CCTGGAACGC CGCCGCGATC GGCGGGGAGT ACCTGTCGGC CGCGAGCTTC
CTCGGCGTCG CCGGTCTGAT CCTCAAATTC GGGGTCGACA TGCTCTGGTA CCCGGTCGGC
TTCGCCGCCG GCTACCTGGC TCTGCTGCTC TTCGTCGCCG CGCCGCTACG CCGCTCCGGG
GCGTTCACCC TCCCCGACTT CTGTGAGCTG CGGCTGGGCT CCCGGCGGCT CCGCACCCTC
GCCACCATCT TCGTGATCTT CATTGGTTGG CTCTACCTCG TCCCGCAGCT ACACGGAGCC
GGGCTGACCC TGGCCACCCT GACCGGCTCC CCCTACCCGG CGGGCGCCCT CCTGGTCGCG
GCGGTGGTCA CCGCGAACGT GGCGCTCGGC GGCATGCGTG CGATCACCTT CGTCCAGGCG
TTCCAGTACT GGCTCAAGCT GACCGCCCTC GCCGTACCGG CGATCTTCCT GGTGTTGCAG
TGGCAGGCCG ACGCCCGCCC GGCCATCACC CCACCCGAGG GTCCGACCTT CCGCACCGCG
ACCACCGTCG TGGTCGAACA TCCCGCCACC CTCACCCTGC CCGACGGTCA GCTGGAGGAG
GTACGCCCCG GCGACGAGCT AACCTTTGCG GCCGGGCAAC CGGTGCCCGC GGTGATCGGG
ACCGCCACCG ACGCGGCCGA CTGGTTGCTG CCCAGCACCG CCGGGACCGA CGACCGGGGC
CTGTTCACCA CCTACTCGCT GATCCTCGCC ACGTTCCTCG GCACCATGGG CCTGCCGCAC
GTGCTGGTGC GCTTCTACAC CAACCCGGAC GGCGCCGCCG CCCGCCGCAC CACCCTGGTG
GTGTTGGCCC TGGTCGGCGC CTTCTATCTG CTGCCGACCC TCTACGGGGC ACTCGGCCGG
ATCTACACCC CACACCTGCT GCTCACCGGG GAGACCGACG CGGTGGTGCT GCTCCTGCCC
AACGCGGCGC TGGGTGACGG CACCACCGGC CGGCTCCTCG CGGCACTGGT CGCCGCCGGG
GCCTTCGCGG CATTCCTCTC CACCTCCTCC GGCCTGCTCA CCAGCGTTGC CGGAGTGGTC
TCCACAGACG TGCTGCGACG CGGCTCGGTA CGCGGGTTCC GACTCGCCAC CGTGCTCGCC
GCCGGCGCGC CCACGGTGCT CGCGCTCAAC GTCTCCGGGC TGGAGGTGTC ACAGGTGGTG
GGGCTGGCGT TCGCGGTGGC CGCGTCGAGC TTCTGCCCCC TGTTGGTGCT GGGCATCTGG
TGGCGGGGAC TGACCGACCG CGGCGCCGCC GCCGGAGTGC TTGTCGGCGG CGGCGCCGCG
GTCGGGGCGG TGTTGGTGAC CGTGCTCGGC CCACCGCTGA CCGGGTGGCC GGCCACGCTC
GTCGCGCAGC CAGCCGCCTG GACGGTCCCG CTCGCCTTCA CCGTCATGGT GGCGGTGTCG
ATCGCCACCC ACCGCCGCGT CCCGACTGAT GTCGGCGTCA CGATGCTCCG CCTGCACGCA
CCCGAAGCCC TCCGGCCGTA G
 
Protein sequence
MGNGYVVPAI VAVTLLTIGI GFYGLRLART TSDFLVASRA VSPTWNAAAI GGEYLSAASF 
LGVAGLILKF GVDMLWYPVG FAAGYLALLL FVAAPLRRSG AFTLPDFCEL RLGSRRLRTL
ATIFVIFIGW LYLVPQLHGA GLTLATLTGS PYPAGALLVA AVVTANVALG GMRAITFVQA
FQYWLKLTAL AVPAIFLVLQ WQADARPAIT PPEGPTFRTA TTVVVEHPAT LTLPDGQLEE
VRPGDELTFA AGQPVPAVIG TATDAADWLL PSTAGTDDRG LFTTYSLILA TFLGTMGLPH
VLVRFYTNPD GAAARRTTLV VLALVGAFYL LPTLYGALGR IYTPHLLLTG ETDAVVLLLP
NAALGDGTTG RLLAALVAAG AFAAFLSTSS GLLTSVAGVV STDVLRRGSV RGFRLATVLA
AGAPTVLALN VSGLEVSQVV GLAFAVAASS FCPLLVLGIW WRGLTDRGAA AGVLVGGGAA
VGAVLVTVLG PPLTGWPATL VAQPAAWTVP LAFTVMVAVS IATHRRVPTD VGVTMLRLHA
PEALRP