Gene Oter_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_1986 
Symbol 
ID6206846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp2542780 
End bp2545158 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content67% 
IMG OID641691636 
Productvon Willebrand factor type A 
Protein accessionYP_001818869 
Protein GI182413803 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.093276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAT TTCGCATCAC GACAGACGAC CCCCGGCTCA CCGCTTACGC GCTGGGCGAA 
CTCGAGGGAG AGGAACAGCA GCAGATCGAC GCGGCAGTGC GGGCGGATCC GGCGCTGCAG
GCGGAAGTGG CGGCGATTCG CGCGCTGGCC GGACACATGA ACGAAGCGCT CGCGGCGGAG
GCGTTGCCCG AGGTGGCGGA ACAGCCTTCG CGCTTGCAGA CGGCTGCGAT CATTCCCGGC
CCCCCAGATC AGCTCGATGG CGGACGTTCG AGTGGGGAGC AAGCGAAGCA GCTGGGCACG
CTGCTGCGGT TTTCGCGCTG GTATTTGGTC GGCGGCGGAT TGGCGGCGGC GTGCCTGGTG
CTGGTGTTGG CGCTGCACTC CGTGCCGCAC GACACTGGCG TCGCGGCGCC TGTCGCGAAA
CAACTTCAGG CCGCGGCCGA TAGCGCGCTG ACTGCAACGA CATCCTTGAC GGGACCGATG
GCGCAGAAGC GTCGGGTGGA GGAACCGATC AATCTGCCGA TTTCCCCGCA GCAGGCGCCA
GCGGATTTGG AGGCGTTCGC GGTGGTCGCA ACGCCGCCCG CGCCGCCGGT TGCCGGTCAG
CCGCTGCTGA AGCAAGTCGA GCGGCAGCCG AATCTGTCCA AAACGTCGAG TTTCGCTGCG
CCGGAATCGA CCGCCTTCGG TGCGCTTGCT CGCGCCGAAC TGCGGGAGAC GCAGCGGCAG
GTCCGTCAAG CTCGCGCGCA AAAAAAAGAT GCGGCGATGC AAGCGCTTCT GGTGGCGAAC
GAGGAGCCGG CGGCCTTGAG CTCGTTCCCG GGGCAGGCAC CCGCGATGGA CGGATATATT
GCGAGCACAA CCTTCGCCGG CATCGGCACG CGGGTGCGGG GGGATCATCG CCAGGCGATG
AACACCGAGG CGTATCGTTT TCTGCGCGAG AGCGACTTCC TCTCCGCCCG CGAGCATCCG
CTGTCGACGT TCGCAGCGGA TGTCGACACG GCGAGCTACG CGAATGTGCG GCGGTTCCTG
CGCGAAGGTC GGCTGCCGCC GGCGGACGCG GTGCGAATCG AGGAACTGGT GAACTATTTT
CCGTATCGCT ATGCGGCGCC GGGCAGAGTC CGCGACGAGG GCGTCGCGGC TCCAGGGGAG
GCCCCCTTCG CGGCGGCGTT GGAGGTGGCA GCGGCGCCAT GGGCGGCGCA GCACCGGCTG
GTGCGGATTG GGCTGAAGGC GAAAGACGCG GCCGTGAGCG GGCGCGCGGC GGCGAATTTG
GTGTTCCTGC TCGACGTGTC CGGTTCGATG GACCAGCCGA ACAAGCTGCG GCTCGTGCAG
GAATCGATGC GGCTGTTGCT CGGCCGGCTG CAGCCGGAGG ATCGCGTCGC GATCGTGACC
TACGCGGGAA ATAGCGGGCT GGCGCTGCCG TCGACGCCGG TGGCGCGGCA GCGCGAAATC
CTCGACGCGA TCGACGAACT GAGAGCGGGC GGCTCGACCA ACGGCGCGAT GGGGCTCCAA
CTCGCCTACG ACATCGCGAA GGCGAACTTC GTGGCGAACG GCGTGAACCG CGTGATCCTG
TGCACCGACG GCGATTTCAA TGTCGGCGTG ACCAGCGAAG GCGAACTGGT GCGGCTGATC
GAGGAGAAGG CGAAGTCCGG CGTGTTCCTG ACCGTACTCG GCTTCGGCAT GGGCAACCTC
AAGGATGCGA TGCTGCAGCA GATCGCCGAC CGCGGAAACG GGAGCTATGG CTACATCGAC
ACGCGGCGCG AAGCCGAGAA GCTGCTCGTG CAGCAGGTGA GCGGCACGCT CCTGACGGTG
GCGAAGGACG TGAAGCTGCA GGTGGAATTC AACCCGGCGA AGGTGGCGCG CTACCGGCTG
ATCGGCTACG AGAAGCGGCT GCTCAACCAG GAGGACTTCG CCAACGACAA GATTGACGCG
GGCGAGATCG GCGCGGGGCA CACGGTGACG GCGCTGTATG AAATCATCCC GGTTGGCGCG
AAGGACGCGG AGGTCACGGA AGAAACCGAG CCGGAGGATC GACGCTACAC CTACTCCAGC
GCGGCGCCGT CCGCCGTGGA GAAGCGCACG CTGGCGCACG CGGACGAGCT GTTGACACTC
AAAGTGCGCT ACAAGCAGCC GACGGCCCTG CTCAGCACCC GGCTCGAGTT TCCGCTGAAG
GACGACGGAG GGAACTTTGC CCAGGCGAGC GAAGATTTCC GGTTTGCGAG TGCGGTGGCC
GCGTTCGGGA TGATCTTGCG CGACTCACCG TACAAAGGTG TGGCGACGCT CGACGACGTG
ATCGCGTGGG CCAATGCGGC CACGAGTGAT GACCCCGGCG GCTACCGCGC GGAGTTCGTC
GAACTGGTGA AACAGGCGAG GCTGCTCACG CAGCGGTAG
 
Protein sequence
MNEFRITTDD PRLTAYALGE LEGEEQQQID AAVRADPALQ AEVAAIRALA GHMNEALAAE 
ALPEVAEQPS RLQTAAIIPG PPDQLDGGRS SGEQAKQLGT LLRFSRWYLV GGGLAAACLV
LVLALHSVPH DTGVAAPVAK QLQAAADSAL TATTSLTGPM AQKRRVEEPI NLPISPQQAP
ADLEAFAVVA TPPAPPVAGQ PLLKQVERQP NLSKTSSFAA PESTAFGALA RAELRETQRQ
VRQARAQKKD AAMQALLVAN EEPAALSSFP GQAPAMDGYI ASTTFAGIGT RVRGDHRQAM
NTEAYRFLRE SDFLSAREHP LSTFAADVDT ASYANVRRFL REGRLPPADA VRIEELVNYF
PYRYAAPGRV RDEGVAAPGE APFAAALEVA AAPWAAQHRL VRIGLKAKDA AVSGRAAANL
VFLLDVSGSM DQPNKLRLVQ ESMRLLLGRL QPEDRVAIVT YAGNSGLALP STPVARQREI
LDAIDELRAG GSTNGAMGLQ LAYDIAKANF VANGVNRVIL CTDGDFNVGV TSEGELVRLI
EEKAKSGVFL TVLGFGMGNL KDAMLQQIAD RGNGSYGYID TRREAEKLLV QQVSGTLLTV
AKDVKLQVEF NPAKVARYRL IGYEKRLLNQ EDFANDKIDA GEIGAGHTVT ALYEIIPVGA
KDAEVTEETE PEDRRYTYSS AAPSAVEKRT LAHADELLTL KVRYKQPTAL LSTRLEFPLK
DDGGNFAQAS EDFRFASAVA AFGMILRDSP YKGVATLDDV IAWANAATSD DPGGYRAEFV
ELVKQARLLT QR