Gene Strop_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3801 
Symbol 
ID5060279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4356581 
End bp4357774 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content71% 
IMG OID640476059 
Producthypothetical protein 
Protein accessionYP_001160610 
Protein GI145596313 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA CGACCTTCGG TGCACGCCTA CGAGCCCACC GCGAACGTGC CGGGAAGACT 
CGACCGATCC TCGGCGGTCT GGTCGGTCGC AGCGCGGAGT GGGTCAAGGC ATTGGAGAAT
GGACGCCTAC TTCCGCCCCG CCTTCCCATG ATGCTTCGCC TGGCTGAGGT GTTGGGGCTC
ACCGACCTCG CCGACCTGAC CGGAGATCAG ACCCTGCCCG TGGCGTCGGT GACCCGCGCT
GGTCATCCCG CTCTCGACAA GGTCGCCGCC GCGATACAAC GCTCTCCCCT GCCGGCCGGT
CCGGCTACTC GGGTCGAGGT GCTGCGCGGC TTGGTCGGTC AGGCGTGGGA GTTGTGGCAC
CGTTCAACCA CCGAACGGAC GGCGGTCGCC GGGGTACTTC CCGGGCTGCT GGCGGAGGCG
CAGCGCAGCG CTCGCCGGCT GGACGGCCTG GCCCGCCGAC AGGCGCTCGT CGAACTGGCC
CGGGTCTACC ACCTCACCCA GTTGTACCTG GCCCACCAGC CGTACCCAGA GTTGGTGTGG
CTGGCCGCTG ACCGGGCGAT GTCTGCCGCG CAGGACGCCG ATGACCCGGC CGCGATCGCG
GCGGCCGGCT GGTACTACGC CCACGTCTAC CGCGGAACCA ACCAGATCGA CGCGGCCGAG
CAGGTGCTTG TCGACGCTGC CACTCTGGTG GACCCGGCAG CCGACGATGA GCAGTTGGCC
CGGTGGGGGC AGGCCCAGCT CGGCCTCGCG CTGGGGCACA GCAAGGCTGG TCGAGCCGGG
CAGGCATGGC GGGCATGGGA CGCCGCCGAC GAGGCCGCGG GCCGGCTTGG CAGCACGTAT
ACCCACCCGT GGTTGATGTT CGGGCCGGCG GCCTGCGCGA CGTACGCGGT CACCATCGAA
ACGGATCTGT GCCGGCCGGG CGCGGCGGTT CGGCGTGCGG GCACCATGGA CTACCGTGCG
CTTCCGTCGT GTACGCGTCG GGCCGCCGCC CTGATCGAGG GCGCACGGGC GCATGTGCTG
CATCGAGGTG AGTTGGCGGC GGTGCATCTG CTCGGACGGG CGCTGCGGGA AAGCGTGGAC
ACCATGCGAC ACCACCCGTA CGCCCGCACC GCTGCCCTGG AGCTGTCCAG CCGGCCCGGC
GTGGTCGGGG AGGACGCGAG GGAACTTGCC CTCGCGATCG GCGTAATGGG GTAG
 
Protein sequence
MDQTTFGARL RAHRERAGKT RPILGGLVGR SAEWVKALEN GRLLPPRLPM MLRLAEVLGL 
TDLADLTGDQ TLPVASVTRA GHPALDKVAA AIQRSPLPAG PATRVEVLRG LVGQAWELWH
RSTTERTAVA GVLPGLLAEA QRSARRLDGL ARRQALVELA RVYHLTQLYL AHQPYPELVW
LAADRAMSAA QDADDPAAIA AAGWYYAHVY RGTNQIDAAE QVLVDAATLV DPAADDEQLA
RWGQAQLGLA LGHSKAGRAG QAWRAWDAAD EAAGRLGSTY THPWLMFGPA ACATYAVTIE
TDLCRPGAAV RRAGTMDYRA LPSCTRRAAA LIEGARAHVL HRGELAAVHL LGRALRESVD
TMRHHPYART AALELSSRPG VVGEDARELA LAIGVMG