Gene Rcas_2394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2394 
Symbol 
ID5539875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3084284 
End bp3085522 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content61% 
IMG OID640894526 
Productvon Willebrand factor type A 
Protein accessionYP_001432494 
Protein GI156742365 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACAG GGGTCACGCT CACGTGTACA TGGGGACGCG CGCCACTGGT TGCCAGTGAC 
GCGCCGCAGG TGGCGTACCT GCTGGTGGAA GCGCAGGCAT CGGCGGTCGC CGAAAAAGCG
CCGCTCAATT TCTGCCTCGT GCTCGACCGA TCAGGATCGA TGCAGGGCGC AAAACTTGCT
GCGCTGAAGG AGGCGACCAG GCGAGTGATT GACACCCTGA CACCTCAGGA TATTGTGTCG
ATTGTGCTCT TCGATGACAC CGTGCAGACG CTTGTGCCCG CGACGTTCGC CACCGACCGT
GATGCGCTTA AGGCGCAGGT TGACGCCATT GAGGAAGCCG GCGGCACGGC TATGTCGGGC
GGAATGGCGG CCGGCATTGT GGAATTGCGC AAGCACCATG ACCCTGGACG GGTTAGCGCC
ATGCTGCTGT TGACCGACGG GCAGACTTGG GGCGATGAGG ATCGCTGTCG CGCGCTGGCG
CAGGAACTTG CCCGCGACCA TGTGCGCGTC ACGGCGCTTG GTCTTGGCGC GGAATGGAAC
GAGAAGTTGC TCGACGACAT CGCCGATGCA ACCGGCGGAC TGTCGGATTA CATCGCCGAT
CCGTCCCAGA TCACGACGTT CTTCCAGCAT GCGGTGCGTA TGGCGCAGGG CACGATCGCA
CAGGATGCGC GCCTGCTCCT GCGGTTGGTG CGCGGTGCGA CGCCGCGCGC CGTATATCGC
GCCAATCCGA TCATTGCCAA CCTTGGCTAT CAACCGATTG GCGACAGTGA GATTGCTGTG
CGTCTGGGCG CTATCGAGAC GGATGCTCCA TCGAGTGTGA TCGTTGACAT GATGGTTCCA
GCGCGCGAAG CCGGAGTTTT CCGAGTCGCT CAGGCTGAGC TGCACTACAC GCCGGTTGGC
GGTTCGGAAC AGGTGATCAA ACAGGATATT CTGCTCGAAT TTGTCGCCGA ACCGGCGAAA
GCGGCATATG ATTCGCGTGT GATGAATCTG GTCGAAAAAG TAACGGCGTT CAAATTGCAG
ACACGCGCGC TTGCCGAAGC GGAGGCAGGC AATGTATCGG GCGCCACCCA GAAACTGCGC
GCCGCAGCGA CGCGCCTGCT CGATCTGGGA GAACTCGAAC TGGCGCAAAA GGTCGCCGAG
CAGGCTGAAC AACTCGATCA GGGGCAGGCG ATGAGCGCCG AACGCCAGAA AGAACTGCGC
TACGCGACCC GTCGCCTGAC GCAAAAACTC GAGGAGTAG
 
Protein sequence
METGVTLTCT WGRAPLVASD APQVAYLLVE AQASAVAEKA PLNFCLVLDR SGSMQGAKLA 
ALKEATRRVI DTLTPQDIVS IVLFDDTVQT LVPATFATDR DALKAQVDAI EEAGGTAMSG
GMAAGIVELR KHHDPGRVSA MLLLTDGQTW GDEDRCRALA QELARDHVRV TALGLGAEWN
EKLLDDIADA TGGLSDYIAD PSQITTFFQH AVRMAQGTIA QDARLLLRLV RGATPRAVYR
ANPIIANLGY QPIGDSEIAV RLGAIETDAP SSVIVDMMVP AREAGVFRVA QAELHYTPVG
GSEQVIKQDI LLEFVAEPAK AAYDSRVMNL VEKVTAFKLQ TRALAEAEAG NVSGATQKLR
AAATRLLDLG ELELAQKVAE QAEQLDQGQA MSAERQKELR YATRRLTQKL EE