Gene Strop_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3800 
Symbol 
ID5060278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4355392 
End bp4356432 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID640476058 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001160609 
Protein GI145596312 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC CCGCTAGCCA ACCGAAGAAG GTACGCGGCG AGCCGATCCT CGCCGTCAGA 
GACCTGGTCA AACACTTTCC CATCACCCGG GGCGTCATCT TCCAGCGCCA GGTCGGCGCG
GTCCGCGCCG TGGACGGGGT CAGCTTCGAC CTGCGCCGCG GTGAAACCCT CGGCATCGTG
GGCGAGTCCG GCTGCGGCAA GTCCACCCTG GCCCGGCTGC TGATGCGGCT GGAGATGCCA
ACCTCCGGAG CGGCGACCCT GGAGGGGCGG GACCTCTTCG CCGCCCGCGG CGCACAGCTG
CGCCGGCTAC GCCGCAACAT GCAGATGGTC CTCCAGGATC CGTACACCTC GCTGAACCCA
CGGATGACCG TCGGCGACAT CATCGGGGAA CCCTTCGAGA TCCACCCGGA GGCGGCACCC
AAGGGAAGCC GGCAGCGGCG GGTCCAGGAG CTGCTGGACA TGGTCGGGCT CAGCCCCGAG
CACATCAACC GGTACCCACA CCAGTTCTCC GGCGGCCAGC GGCAGCGCAT CGGCATCGCC
CGCGCGCTCG CGCTGCGCCC CGAGGTGCTC GTCTGTGACG AGCCGGTCTC CGCGCTGGAC
GTCTCGATCC AGGCGCAGGT GATCAATCTG CTCGAGCAGC TTCAGGACGA GCTCGGCCTC
TCGTACATCT TCATCGCCCA CGACCTGTCG GTGGTACGGC ACATCTGCGA CCGGGTCGCG
GTGATGTATC TGGGCCGGAT CGTGGAACTC GGCACCGAGG CCGAGATCTA CCAACGGGCC
ACCCACCCGT ACACCCAGGC GTTGCTGTCG GCGGTGCCGG TGCCCGACCC GGAGCAGCGG
GACAACCAGA ACATGATCCG CCTGATCGGC GACGTACCGA GCCCGGCGAA CCCGCCGAGC
GGCTGCCGGT TCCGCACCCG GTGCTGGAAG GCGCAGGACG TCTGTGCCAC CCAGGACCCG
GACACCGTGC CCCGCGCCGC CGACCCGCAC CCGTCGGCCT GCCACTTCGC CGAACTCCGC
GAACCGGCGT CCCCCTCCTG A
 
Protein sequence
MTAPASQPKK VRGEPILAVR DLVKHFPITR GVIFQRQVGA VRAVDGVSFD LRRGETLGIV 
GESGCGKSTL ARLLMRLEMP TSGAATLEGR DLFAARGAQL RRLRRNMQMV LQDPYTSLNP
RMTVGDIIGE PFEIHPEAAP KGSRQRRVQE LLDMVGLSPE HINRYPHQFS GGQRQRIGIA
RALALRPEVL VCDEPVSALD VSIQAQVINL LEQLQDELGL SYIFIAHDLS VVRHICDRVA
VMYLGRIVEL GTEAEIYQRA THPYTQALLS AVPVPDPEQR DNQNMIRLIG DVPSPANPPS
GCRFRTRCWK AQDVCATQDP DTVPRAADPH PSACHFAELR EPASPS