Gene Rcas_3275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3275 
Symbol 
ID5540773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4251750 
End bp4253009 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID640895393 
Productvon Willebrand factor type A 
Protein accessionYP_001433344 
Protein GI156743215 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.112036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCG AGGTTGCCAT TCGCGCATCG CTGGCGCGTC CGTATCTGAC GGCAGCGACG 
ATGCCGCAGG TTGCGTATCT GCTGATCGAA GTCACGCCTG GTCAGATCAT GACACAAGTG
CGAGCGCCGG TCAATGTCTG TTTTGTCATT GATCGGAGCG GCTCGATGAA GGGCGAAAAG
ATCGACCGGG TGCGACGCGC GACGATTCGC GCAATTGAGA TGCTCGACGC ACAGGATGTC
GTCTCGGTCG TGATCTTCGA TCATCGCACC GAGGTCCTGA TCCCTGCCAC GCCGGTCGCC
AAACCCGCAG AACTGGCTGA TCGCGTCAAT CGTGTGCGCG ATAGCGGCGG AACCCGGATT
GCGCCGGCTA TCGAGGCAGG TCTGCGCGAG ATCGATAAAG GACCGTCGCA CATGGTGCGT
CGCCTCATCC TGCTCACTGA CGGTCAGACC GAGAGCGAGT CCGACTGTCT GCGACGCGCC
GAGGATGCCG GACGGCGCAA CGTGCCGATC ACGGCGCTTG GCGTCGGCAA GGACTGGAAC
GAGGATCTGT TGATCGAGAT GGCGAATCGT TCGGGAGGAA CGGCAGACTA TATTGATCGT
CCAGAAAAGA TCGTCGATTA CTTCCAGAAT ACCATCCAGC GCGCGCAGGC GACGACGGTG
CAGAATGCGA ACGTGACGCT ACGATTTGTG CAGGGAGTAT TGCCGCGCGC CGTGTGGCAG
GTCTACCCGC TTATCACCAA CCTCGGTTAC CGCCCCATTT CTGATCGCGA CGTCAGTGTG
CCGCTTGGTG AACTGGAAAC CGGGAGCGGA CGCACCCTGC TTGTCGAAGT GCTGGTCGAG
CCGCGACCAT CCGGTGAGTA TCGCATCGCT CAGGTCGAGG TAAGTTATGA TATTCCGCTG
CTGAATCTGC ACGGTGAGAA GAGTCGCGCC GACATCATGC TTTCCTTTAC GACTGATGCC
GGGCTTGCTG CGCAGGTGAA TCCGAATGTG ATGAATATCG TCGAGAAAGT CAGCGCCTTC
AAGTTGCAGA CGCGCGCCTT GCAGGACCTC GCTGCCGGCG ATGTCGCGGG AGCGACCCAG
AAGCTGCAAA GCGCCGTGAC CCGGTTGCTC AACCAGGGCG AAGTCGAACT TGCGCAGACG
ATGGAGCGTG AGATTCAGCA TCTGCAACAG ACCGGCAAAC TTTCCAGCGA AGGGCAGAAG
ACGATCAAGT TCGGCGTGCA GAAGACGGTG CGGTTGAGCG ACATCAAGCA GGAGGAATAG
 
Protein sequence
MAGEVAIRAS LARPYLTAAT MPQVAYLLIE VTPGQIMTQV RAPVNVCFVI DRSGSMKGEK 
IDRVRRATIR AIEMLDAQDV VSVVIFDHRT EVLIPATPVA KPAELADRVN RVRDSGGTRI
APAIEAGLRE IDKGPSHMVR RLILLTDGQT ESESDCLRRA EDAGRRNVPI TALGVGKDWN
EDLLIEMANR SGGTADYIDR PEKIVDYFQN TIQRAQATTV QNANVTLRFV QGVLPRAVWQ
VYPLITNLGY RPISDRDVSV PLGELETGSG RTLLVEVLVE PRPSGEYRIA QVEVSYDIPL
LNLHGEKSRA DIMLSFTTDA GLAAQVNPNV MNIVEKVSAF KLQTRALQDL AAGDVAGATQ
KLQSAVTRLL NQGEVELAQT MEREIQHLQQ TGKLSSEGQK TIKFGVQKTV RLSDIKQEE