Gene RoseRS_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3349 
Symbol 
ID5210326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4199767 
End bp4201056 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID640596947 
Productvon Willebrand factor, type A 
Protein accessionYP_001277660 
Protein GI148657455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACAG AACGAACACC GGAAACCGAT CGGTTGTATG AACAGGGTAT GGCGTGGATG 
CGCGAAGCGC GCTGGGAAGA GGCAATCGCC GCTCTATCGC AGGTGCGCGC GCTGTCGCGC
GCCTACCCGG ATGTTGATGC GCTTATCGCC GATGCGCAGT TGAAACTGGA GATCGAGCAG
GTGGGCGTAC CTCCAGCATT GGCGCCGCCA CGTCCACATC CGCCGCGTGC AGCGCTGATC
GGCGGGTTGA CCGCGCTGAC ATTGATCGTT GCGGCGGTCA TTATCGCGCT GATCGGTCGA
ATCGACCCAT CGAGCGTCGG CAGTGCGGCG CAACCGATCA TGCTCAGCAT CGGCTTTCCA
ACCATTGCCC CTACCAGAAC GCCGACCCCT GCGCCGACCG CCACGCCAAT CCCGGTGCGC
CCAACTGCAA CGCCAGCGGC ATCTGTGGTC CTGCCTGGAA CGCTGGGAGT GCGCATGGCG
TCGGGAGAGC GATTACCCGG CGCCACCCGA AACCTGGCAA TCATTCTGGA CGCTTCCGGC
AGTATGCTGG CGCGCATCGA TGGTGCGCCC AAGACGGTGA TCGCTCGCCA GGCGCTGATT
GCGCTGGTTG AACGTCTGCC GGCAACCACG AATGTTGCAT TGCGCACCTA CGGTCATCGG
CGCGCCGACG ATTGCAGCGA TACCGAACTC GTTCAGGCGC CTGCCCCCAT CCAGCGCGCC
GATCTGATCA ACCGCATCAA CGCTATTCGA CCGGTCAACG GTGGACGCAC TCCTATAGCG
CAGTCGCTGG AAGATATGGC GCGAGACCTG GCTGGCGTCG ATGGCGAGGT GCTGATCGTG
CTGGTCAGCG ACGGTGATGA AACCTGTGGC GGCGACCCGG TTGCAACGGC GGCAGCGCTG
CACACCGCCA ATCCCCGTTT GCGGGTGAGT GTGATTGGGT TCAATATCGA ACAGGAAGAG
TGGCGCCGGC GCCTGGAAGG AATAGCCGCG TATGGCGGAG GGGCGTACTT CGATGCTGCG
AATGCCGTGC AACTCGCCGA TGCCCTCGAA CAGGCGGTTG CGCTGACTTA CCGTGTGATC
GACAGTCAGG GAAACCAGGT CTACCAGGGA CGGATCGGGA ACACGGTTAC CCTGCCACCC
GGCGCCTATC GTGTTGAAAT CAGCGGTGAT GCTGCGATAA CCTTTGAGAC AGTCTTTGTT
GAAAGTGGAC ACACCACATT TGTCGAACTG CGTGATGAAC AGGGGGCGTT GCGCGCCAGC
ATCATCGCGG GTGATGACGT AGCGCCGTGA
 
Protein sequence
MNTERTPETD RLYEQGMAWM REARWEEAIA ALSQVRALSR AYPDVDALIA DAQLKLEIEQ 
VGVPPALAPP RPHPPRAALI GGLTALTLIV AAVIIALIGR IDPSSVGSAA QPIMLSIGFP
TIAPTRTPTP APTATPIPVR PTATPAASVV LPGTLGVRMA SGERLPGATR NLAIILDASG
SMLARIDGAP KTVIARQALI ALVERLPATT NVALRTYGHR RADDCSDTEL VQAPAPIQRA
DLINRINAIR PVNGGRTPIA QSLEDMARDL AGVDGEVLIV LVSDGDETCG GDPVATAAAL
HTANPRLRVS VIGFNIEQEE WRRRLEGIAA YGGGAYFDAA NAVQLADALE QAVALTYRVI
DSQGNQVYQG RIGNTVTLPP GAYRVEISGD AAITFETVFV ESGHTTFVEL RDEQGALRAS
IIAGDDVAP