Gene Strop_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2018 
Symbol 
ID5058481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2289032 
End bp2290375 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content68% 
IMG OID640474283 
ProductVWA containing CoxE family protein 
Protein accessionYP_001158849 
Protein GI145594552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.531693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.264381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG ACCATCACGA TCGTGCCGAC GAGAATCGGC GGCAGGTGCT CTACTGGCGG 
TTACTGGCCC GGCTCTTCGA CCCGACCGAG CAGCCTACGC TGGAAGCGGC GAGCCTGGCC
GTGGTCGAGG ACCTGGGGCT GCCCGCCGCG CTGCTGGACC CGGCCGTCTC GGTGGACACT
GTGGTCCAGC GCTACCCGTC GCTCGCCGCG GAGCTGCGCG GCCTGCTCAC CCCTGCCGAG
CACGCACCAG TGGCCGACGG CACCGCCACA CCGGAACCCG GCCCAGCCGA GGTGCGGCGG
GCCGCGCTGG TGTCGAAGGT GCTGCTCAAC GTGTTCGCCG CCGGCACCGG GCCGGTCAGC
GCCAGCGAGC TGGCCCGGTG GCAGTCCGAC GCCGGCTGGT TTGAGCAGGC GCTCGGTGCC
GAGCCGGGCG AGCTGCGCCG GCAGGGCGGG ACGGGGCTGG GTGGTGCTCT CGCCGCGCTG
GAGGGCGACC TGGTACGCCG GATGCATCTG CGCGAGGTGC TCGCCGACCC GGCGCTGGCC
AGCCGGCTCA CGCCGAGCAT GTCGCTGATC GAGCAGTTGT TGCGGGACAA GGCGAACCTG
TCCGGGGTAG CGCTGGCCAA CGCCAAGGCG CTGATCCGCC GGTTTGTCGA CGAGGTTGCC
GAGGTGCTGC GCACGCAGGT GGAGCAGACC AGCGTCGGAA CCATCGACAG GTCGGTGCCG
CCGAAGCGGG TGTTCCGTAA CCTCGACCTG GATCGCACCA TCTGGCAGAA CCTGACCAAC
TGGAGCCCCG AGGACCAGCG GCTCTACGTC GATCGCCTCT ACTACCGGCG TACGGCCCGC
CGTACGACAC CGGCTCGGCT GATCGTGGTG GTGGACCAGT CAGGCTCGAT GGTCGACTCG
ATGGTCAACT GTACGATCCT GGCGTCGATC TTCGCCGGGC TACCCAAGGT GGATGTGCAT
CTGATCGCGT ACGACACACA GGCGTTGGAC CTGACCCCGT GGGTGCGCGA CCCGTTCGAG
GTTCTGCTGC GCACCAAACT GGGTGGCGGC AACGACGGTC CGGTCGCCAT GGCGATGGCC
CGGCCCAAGA TCGCCGAACC GCGGAACACC GTCATGGTGT GGATTTCGGA CTTCTACGAG
TTCGACCGAT CGCAGCCCTT GTTCGACGGC ATCGAAGCCG TGCACCGTTC CGGCGTGCGA
TTCATCCCGG TCGGCTCAGT CAACAGCTCC GGGCAGCAGA GCGTCAATCC CTGGTTCCGG
CAACGCTTCA AGGACCTGGG CACCCCCGTG ATCTCGGGTC ACATCCGCAA ACTCGTCTTC
GAGCTCAAGA GCTTTCTCAC CTAG
 
Protein sequence
MTLDHHDRAD ENRRQVLYWR LLARLFDPTE QPTLEAASLA VVEDLGLPAA LLDPAVSVDT 
VVQRYPSLAA ELRGLLTPAE HAPVADGTAT PEPGPAEVRR AALVSKVLLN VFAAGTGPVS
ASELARWQSD AGWFEQALGA EPGELRRQGG TGLGGALAAL EGDLVRRMHL REVLADPALA
SRLTPSMSLI EQLLRDKANL SGVALANAKA LIRRFVDEVA EVLRTQVEQT SVGTIDRSVP
PKRVFRNLDL DRTIWQNLTN WSPEDQRLYV DRLYYRRTAR RTTPARLIVV VDQSGSMVDS
MVNCTILASI FAGLPKVDVH LIAYDTQALD LTPWVRDPFE VLLRTKLGGG NDGPVAMAMA
RPKIAEPRNT VMVWISDFYE FDRSQPLFDG IEAVHRSGVR FIPVGSVNSS GQQSVNPWFR
QRFKDLGTPV ISGHIRKLVF ELKSFLT