Gene Strop_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3594 
Symbol 
ID5060069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4110526 
End bp4112256 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content69% 
IMG OID640475849 
Productvon Willebrand factor, type A 
Protein accessionYP_001160403 
Protein GI145596106 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCCAG GCCGCCATCG CACCCGAACG AACATCCGCA CCGCCGGTGT CGCCGCAGCT 
GTCGGGGTAC TGGTCATTGC TGCTGGTGGA TACTTCGGCT ACCGGCAGCT AGCCTCGCCG
GGGTGCTCCG GCCAGGTTGA GCTCGCGGTC GCGGTCGCGA CTGAGCTGGC ACCGGCAGTC
GACGCCGCGG CGACCGAGTG GGCGAACGAG GGCGCGGTGG TGGATGGCAG CTGCGTTGAG
GTAAGCGTGA CGGCTGCCGA GCCGGTCGAG ATAGCCGCCA CCGTCGCGGC CAAACACGGT
GCCATCCTGG CCGGGGTGGG GCAGGTCAAC GGCGCCGCGG TCAGCCCGGA TGTCTGGGTG
CCCGACTCGT CGGCGTGGCT GCTACGGCTT CGGAGCGGCG GTGCGACCGC ATTCGATCCG
GGTAACGGAG CGTCAATCGC CCGCAGCCCG GTGGTCCTGG GGGTGCCCGA GCCGATCGCC
TCCCAGCTCG GCTGGCCGGC GCAGGAACTC ACCTGGTCCG CGCTGGTCGG CCAGGTTAAC
AGCGCTAAGC CGCTCAAGGC CGGCACCGTG AACCCGACCC GAGATGCCGC CGGTCTCTCC
GGGCTACTCG CGCTGAGCGC CGCCGCGGCG GCCGGGCAGG ACGGCCAGGC GGCAACCGTC
GGCGCGTTGC GGGCCTTGTC CACCAGCAGC GCGAATCTGC GCCAGGAACT GCTCTCGAAG
TTCCCCACCG CCGCAGACTC CACCACGCTG GCCCGGAGTC TCGGCGCGGC GGCGTTGTCT
GAGGAGGATC TGCTCTCGTA CAACGCCCGG AAGCCGGCGG TGCCGCTGGC CGCGCTCTAC
CCGGAGCCAG CGGCGAACCC GTTGGACTAC CCGTACGCGG TGCTGCCGGG GATCGGGCCG
GCCAAGGCGT CGGCTGCCCA GATGCTTTTC GACGTGCTCA CCACGGCCAG CTTCAAGGAT
CGGTTGGCGT TGTCGTCACT ACGAGCGCCG GACGGTACCT GGGGTGCTGG TTTCAGCGCG
CCCGCAGGGG CGCCGAGCCC GGCGGCCGAT GGTGGCAACG CCGCTGGTGA CCTGGACCCA
CTGGCGGTCG AGCGAGCGGT CTCCAGCTGG TCGATCGCCA CCCAGTCCGG CCGGATGCTC
TGTGTCATCG ATGTCTCTGG CTCGATGCGG GAACCCGTGG CGAGCGCCAA CGGTGTGAGC
CGCCAGCAGG TCACCCTGGA TGCCGCGGGG CGGGGGCTCC ACCTCTTCGA TGACAGCTGG
CAGATCGGGC TCTGGGAGTT CTCGACCAAC CTGGGCAGCG GGCGGGACTA CCGGCGGCTG
GTCGAGATCG GCCCGCTGAG TAGTCAGCGG AGCGAGCTTG AGCAGGCGTT GGCCCAGATT
CAGCCGACCC GGGGTGACAC TGGTCTGTTC GACACGGTGC TCGCCGCGTA CGAGGCAGTC
CAGGAGGACT GGGACGAGGG CCAGGTCAAT TCGATCGTGC TCTTCACCGA CGGCAAGAAT
GACGATGACA ACGGCATCAG CCAGCAGCAG CTGATCGCCG AACTGGAACG GATCAAGGAC
CCGGAGCGGC CGGTGCAGGT CGTTCTGATC GGGATCGGCG CGGACGTCAG CAAGGCAGAG
CTGGAGTCGA TCACGGAGGT TACCGGTGGT GGCTCCTTCA TCACCGAGGA CCCGACCAAG
ATTGGTGACA TCTTCCTGAA GGCCATCGCA CTGCGCGAGC CGGATGCCTG A
 
Protein sequence
MSPGRHRTRT NIRTAGVAAA VGVLVIAAGG YFGYRQLASP GCSGQVELAV AVATELAPAV 
DAAATEWANE GAVVDGSCVE VSVTAAEPVE IAATVAAKHG AILAGVGQVN GAAVSPDVWV
PDSSAWLLRL RSGGATAFDP GNGASIARSP VVLGVPEPIA SQLGWPAQEL TWSALVGQVN
SAKPLKAGTV NPTRDAAGLS GLLALSAAAA AGQDGQAATV GALRALSTSS ANLRQELLSK
FPTAADSTTL ARSLGAAALS EEDLLSYNAR KPAVPLAALY PEPAANPLDY PYAVLPGIGP
AKASAAQMLF DVLTTASFKD RLALSSLRAP DGTWGAGFSA PAGAPSPAAD GGNAAGDLDP
LAVERAVSSW SIATQSGRML CVIDVSGSMR EPVASANGVS RQQVTLDAAG RGLHLFDDSW
QIGLWEFSTN LGSGRDYRRL VEIGPLSSQR SELEQALAQI QPTRGDTGLF DTVLAAYEAV
QEDWDEGQVN SIVLFTDGKN DDDNGISQQQ LIAELERIKD PERPVQVVLI GIGADVSKAE
LESITEVTGG GSFITEDPTK IGDIFLKAIA LREPDA