Gene Rcas_2162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2162 
Symbol 
ID5539642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2777336 
End bp2778613 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content62% 
IMG OID640894295 
Productvon Willebrand factor type A 
Protein accessionYP_001432264 
Protein GI156742135 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.314741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000328198 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACACCGG GCATTAACCT TCAGCAGACC CTGAGCCGGA CAACGCTGGC GGTCGGTGAC 
GAACCGCAAT TGATCTATGT GCTCCTGGAA GCGCACGCTG AAGGGTTGGC ACAGCAGTTG
CCCAAATTGC CGCTGAATCT GTGCCTGGTG CTCGATCGCA GTTCTTCGAT GCGCGGGGAG
CGCCTGATGC AGGTCAAGGA CGCCGCAGCA CGCATCGTCG ATCAATTGGG GCAGGACGAT
TATTTTTCGC TGGTGGTGTT CAACGACCGG GCTGATGTGG TTATTCCGGC GCAGCGCGCG
ATCAAGAAGG CGGACCTGAA AGCGGCGATT GCGCAGATTG AGGCGGCCGG CGGCACGGAA
ATGGCGCAGG GGATGGCGCT GGCGCTCCAG GAGGTGCAAC GACCGTTTCT GACACGCGGC
ATTAGCCGGA TCATTCTGTT GACCGATGGC CGCACCTACG GCGACGAGAG CCGGTGTGTC
GAGATCGCTC GCCGCGGGCA GTCGCGCGGC ATTGGGTTGA CGGCGCTCGG AATTGGAACG
GAATGGAACG AGGACCTGCT CGAAACGATG ACCGCCAGCG AAAACAGTCG TGCTCAGTAC
ATCGCCACTG CCCAGGATGT CGTCAAGGTC TTCGCCGATG AGGTGAAGCG CCTCCATGCC
ATCTTCGCCC AACAGGTGCA ACTGTCGGTC GAGACACGCC CCGGCGCGTT GTTGCGGTCG
CTCGATCAGG TGCGCCCTTT CATTGCGCCG ATTACCATTA TCGAAGAAGC AGAGCGCCGC
TGGGTGGCCA ATCTGGGAGA CTGGCCCGAT ACCGGCGTGC AGGGATTTCT GCTCGAAGTC
GTTGTGCCTC CCTTACCGGT TGGTGATCAC GCGGTGCTGA AACTGACGTT GCGCTATCAT
CTGCCTGGGG CAAACCTGCG CGATCAGGCG CGTGAACTCA TGGTTCGCGT TAGCCTGCGC
CCGGCGGAAG AGGTCACCCA TCGCGTCGAT GCAACCCTCA AACACTGGCT GGAGCGCCTG
GTGGCGTATC GCCTGCAAGC AAACGCCTGG AAGTGCGCGG CGGAAGGACG ACTCGAGGAA
GCGAGCGAGC GTCTGCAAAT GGCAGGAACG CGCATGCTCA ACGCTGGCGA CGCGGCGCTG
GCGCATACGT TGCAACAGGA AGCGACGCGC ATTCTGCGCA ACGGAACGGT GAGCGAAGAG
GGACGCAAGC GCATCCGCTT TGGCACTCGC GGTCTGATCG GTCCGGTTGC CGACGATGAA
CGCGAGACTA CGACGTGA
 
Protein sequence
MTPGINLQQT LSRTTLAVGD EPQLIYVLLE AHAEGLAQQL PKLPLNLCLV LDRSSSMRGE 
RLMQVKDAAA RIVDQLGQDD YFSLVVFNDR ADVVIPAQRA IKKADLKAAI AQIEAAGGTE
MAQGMALALQ EVQRPFLTRG ISRIILLTDG RTYGDESRCV EIARRGQSRG IGLTALGIGT
EWNEDLLETM TASENSRAQY IATAQDVVKV FADEVKRLHA IFAQQVQLSV ETRPGALLRS
LDQVRPFIAP ITIIEEAERR WVANLGDWPD TGVQGFLLEV VVPPLPVGDH AVLKLTLRYH
LPGANLRDQA RELMVRVSLR PAEEVTHRVD ATLKHWLERL VAYRLQANAW KCAAEGRLEE
ASERLQMAGT RMLNAGDAAL AHTLQQEATR ILRNGTVSEE GRKRIRFGTR GLIGPVADDE
RETTT