Gene Strop_4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4165 
Symbol 
ID5060649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4735674 
End bp4736924 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID640476427 
Productmajor facilitator transporter 
Protein accessionYP_001160972 
Protein GI145596675 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0148559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTGG GTCGAAATTT CGCCAAGCTT TGGTGGTCGC AGGGCCTGTC GAATCTTGGT 
GATGGTCTGG TACTGGCCGC CGTGCCCCTC CTCGCCGTCA CGATGACCCG CGACCCGCTG
CTGGTCGCGG GGATGACCGT GGCGCAGTTC CTGCCATGGC TCCTCTTCAC TCTGCCAGCG
GGAGCTCTCG CCGACCGCAT CGATCGGCGC CTCATCATGG TGGCGGGAAA TCTGGTCCGC
GCGGTGGGAT TCGGACTGCT CGTCCTGACG CTCGTGGCCG ATGTACGCAG CATCGCAGTT
CTCTACATCG CGGTGTTCCT CGCCGGCACG GCAGAGACGC TCGTGGACAA CGCGGCACTC
ACGGTGCCGC CGCGCCTGGT GCAGCGCAGT GACCTCGAGC GGGCGAACGG ATGGCTCTTC
GCCACTCAGT CGGCCATCAA CAACTTTGTC GGACCCACCG CGGGTGCCGC TCTGTTCGCG
ATGTCGGCGG TCATGGTGTT CTCCTCCACA GCGGGATTCT TCGCACTCGC CGCGCTGGCA
GCCATCATGC TGCCACGGAT GATGCCGACC GCGAGCGACT CGAGCGGTGA CAAGCACACG
CCGGGCGAGG TTGTGCGCAG CATCAGGGAA GGCTGGTCGT ACTTCTGGAA CCACCGGCTG
ATGCGGCGGG TGGCCTTCAT CTCTGGTTCG ATCAATCTGT TCTCGTCCGC CACCGGCGGT
CTGCTCGTCC TCCTGGCGAC CGGTCCCTTG GGCGTGCCGG CGTCGTGGTA CGGCCTGTTC
ATCGCCGTCC CGGCCGTGGG AGCCGTGCTC GGCTCGCTGA TCGCCGCGCG AGTGGTGCCG
GCCATCGGCG GGGGACCGGT GACCTGGCTC GCCGCACTCG TGCCCGCCGC CAGCTATGTC
GTCCTCGGTC TCAGCGGCAG CATCGTCCTC TCCGAGGTGG CCATGTTCCT CGCCGCGGTC
GCCACGGCCT TGAACCAGAT CGTGGTCAGC ACGCTCCGGC AGGCTGCCGT CCCCGACGGG
GTCCTCGGCC GGGTAACCGC CGGCTACCGT CTGATCGTGC TCGGCGCGGT GCCGGTCGGG
GCGCTCCTCG GCGGCGGACT GGGGCGTTGG CTGGGCCCGG AGACCACCTT CGTCGTCTGC
GGCGTCGGAC TGACCGTAGC GGCGATCGTT TTCGCCTCCC GGGTCACGAC CCGTGCCCTG
CGCGAGGCGG AGGAGGTAGC CCGGACTGCG GCCCGTGAGC CAGCCGCCTG A
 
Protein sequence
MTLGRNFAKL WWSQGLSNLG DGLVLAAVPL LAVTMTRDPL LVAGMTVAQF LPWLLFTLPA 
GALADRIDRR LIMVAGNLVR AVGFGLLVLT LVADVRSIAV LYIAVFLAGT AETLVDNAAL
TVPPRLVQRS DLERANGWLF ATQSAINNFV GPTAGAALFA MSAVMVFSST AGFFALAALA
AIMLPRMMPT ASDSSGDKHT PGEVVRSIRE GWSYFWNHRL MRRVAFISGS INLFSSATGG
LLVLLATGPL GVPASWYGLF IAVPAVGAVL GSLIAARVVP AIGGGPVTWL AALVPAASYV
VLGLSGSIVL SEVAMFLAAV ATALNQIVVS TLRQAAVPDG VLGRVTAGYR LIVLGAVPVG
ALLGGGLGRW LGPETTFVVC GVGLTVAAIV FASRVTTRAL REAEEVARTA AREPAA