Gene Strop_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2073 
Symbol 
ID5058536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2345879 
End bp2347189 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID640474336 
Productvon Willebrand factor, type A 
Protein accessionYP_001158902 
Protein GI145594605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.04125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.439198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGCT GTCCTAGACT GCCGTCGATG ATCAAGACGA GACGATTGGC GGCAGCCCTC 
GTCGGGCTGC TGGCAGCGAG CGTGATGACT GGTCCGGTGC CGGCCCTCGC GGACTGGGAG
ACCCCGGTCG AGCCGCCGAA GGTCGAGCTG GTCCTCGACG TCAGCGGATC GATGCGGGCC
ACCGACATCG ACGGGCGGAG CCGAATCTCG GTCGCCCAGC AGGCGTTCAA CGAGGTGGTG
GACGCGCTGC CGGATGAGAC TGAACTGGGA ATCCGGGTCC TCGGTGCCAC CTATCCGGGT
GACGACAAGG AGCAGGGCTG CCAGGACACC CAACAGATCG TGCCGGTCGG ACCGGTCGAT
CGGGTGCAGG CAAAGGCAGC GGTGGCGACG CTTCGTCCGA CGGGTTACAC GCCGGTCGGA
CTGGCGCTGC GCTCGGCCGC CGAGGATCTC GGTACGGGTA GCACCGCCCG GCGGATCGTG
CTGATTACCG ACGGCGAGGA CACCTGCGCC CCACCAGACC CCTGTGAGGT GGCCCGAGAG
CTGGCTGCGC AGGGGACGAA GCTGGTCGTG GACACCCTCG GCCTGGCCCC GGACGAGAAG
GTGCGTCAGC AACTGCTCTG CATCGCCGGG GCCACTGGTG GCACGTACAC CGCGGCGCAG
AGCGCGGACG AACTGACCGG GCGGATCAAG CAACTGGTCG ACCGGGCCCG GGACACGCAC
ACGGCCACGC CGGCCGTGGT CGCCGGTACC TCGGTCTGTG CCGACGCCCC GCTACTCGGC
GCCGGCGTCT ACAGCGACCG GGAGAAGTTC TCGGAGCACC GCTGGTATCG GGTGCCGGTG
TATCCCGGGC AGGAGCTGCG CGCCTCTGTC AGTGTGGCGT TGGACCGGCC GGTCAACCCC
GACCATGCGG TGCTGCTGCG GGCGGTGGCC ACCGACGGTC GGGAACTGGT GCGTGGCGTG
GACGCCGGTA GCGGCCGGAC CGATGTCGTC TCCGCCGGTC TGCGTTGGTC GGCGGGGGAG
CAGCCGGAGG ATGGGCCCTC CCCAACCCCG TCGACTACCA CCGACGCCGA AGCCACCATC
GTCTGTCTCG TGGTGAGTAA CGCCTTCGCA CCCCAGCCGG GGACCCAGAT GTCGCCGGGT
ATGCCGGTTG AGTTGACCGT GGACATGGTC GTGTCCTCGC CTGCTCCGGC TGCCCCGGAT
CTCGGTCGTG GCTGGGTGCT GCTCGTCCTG CTGACCGGGG TTGGTCTGCT GGCAGGACTG
GCGTCCGGGG TGCTCACCCG GTGGTGGGTA ACGACCTGGA GGGAGAAGTG A
 
Protein sequence
MWSCPRLPSM IKTRRLAAAL VGLLAASVMT GPVPALADWE TPVEPPKVEL VLDVSGSMRA 
TDIDGRSRIS VAQQAFNEVV DALPDETELG IRVLGATYPG DDKEQGCQDT QQIVPVGPVD
RVQAKAAVAT LRPTGYTPVG LALRSAAEDL GTGSTARRIV LITDGEDTCA PPDPCEVARE
LAAQGTKLVV DTLGLAPDEK VRQQLLCIAG ATGGTYTAAQ SADELTGRIK QLVDRARDTH
TATPAVVAGT SVCADAPLLG AGVYSDREKF SEHRWYRVPV YPGQELRASV SVALDRPVNP
DHAVLLRAVA TDGRELVRGV DAGSGRTDVV SAGLRWSAGE QPEDGPSPTP STTTDAEATI
VCLVVSNAFA PQPGTQMSPG MPVELTVDMV VSSPAPAAPD LGRGWVLLVL LTGVGLLAGL
ASGVLTRWWV TTWREK