Gene RoseRS_1396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1396 
Symbol 
ID5208348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1704133 
End bp1705647 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content62% 
IMG OID640595007 
Productvon Willebrand factor, type A 
Protein accessionYP_001275746 
Protein GI148655541 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000309187 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000141462 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTATCATC AACCTCGATG GGCGATCATG CCCCTGCTCT GCGGGTTTAT CCTTCTTGCG 
GCGTGCGGCG GCGCTCCGCC GCCTGCAAGC CAGGTGGATG TGCAGGCGAC CATTGATGCT
GGCGTGCGCG CAACGCTGGC GGCGCAGCCG ACGGAGGCGC CATCACCTAC TACCCCTCCG
CCATCGCCAA CCGCTGTTCC GCCAACGGCG ACAACCGCGC CCACCAGTCC ACCGGTGGCG
CCGACGGTCG CGCCCTTTAC GCCCGCCACG CCGACCGCGA CCGGTGCAGA CGCGCCAACA
ACTGTACCTG AGTCCAGTTC CCCGCCGACT GACACGACCA CCATCTTTCG CCCTGCAGAA
GGCGAGGCGG CGCAGGTGAC CACCAACATT CAGCTCGTCT TCGACGCGAG CGGTTCGATG
GCGCAGCGGA TCGGCGGCGA GACCAAAATC CAGGCTGCGC GCCGCGCCAT GGAACGGATC
ATCGACACGC TGCCCGACAA CCCCGATCTG AACGTCGGCT TCCGGGTGTT CGGGCACGAA
GGCGACAGCA GCGAAGCGCA AAAAGCGCGT TCATGCCAGA GCACTGCGCT GCTGGTGCCG
ATGCAGGGAG TCAATAAAGC GTTGCTGCGG CAACAGGCTC AGGCATGGCA ACCGACCGGA
TGGACGCCGA TCAGTCTGGC GTTGCAGAGA GCAGGGGAGG ATTTCCAGGC GGGAGAGAAT
GTGCGTAACG TCATCATTAT GGTGACCGAT GGCGAAGAGA CGTGCGGCGG CGACCCGTGC
GCAGTTGCGA AGGCGCTCGC CGAGTCGCAG GCGGAAGTGC GCATCGACGT GGTTGGGTTC
GGGACGACGC CGGACGTGGC AAAAACCCTG CGGTGCATTG CCGAGAACAG CGGCGGCGTC
TATACTGATG CGCAAAATGG TGATGCGCTG GTGCAGACTC TGGAGGAACT GATCGCCGCT
ACCCTCAAAC GGAGCACTCT GCGCTTCATC CCTGTGAGCA TAAGCGGCGC ACCGGAAGAG
GTATCGCTGA CCCGCCTGGT CAATGCCCGG GGGGAAGACG TTATGAAAAC CGTCCAGCTG
CCATGGATGG CGCGATTTGC CCGCGAGCAG GTGGTGGAAC TTCCACCAGG CGAGTATCGT
TTCACCATCT CCTACAGTGA GATATTCGTC GATCAGACGT CGAAGCATCT TGAGACCACA
TACACCGCAA TCATCGAAGA GGCGCGCGAA ACCGTGGCGG TCATCGGGCG CGGACAGGTA
ACCTTTATCA ACGATTCACC CCAATTGCTC CGGCCGGGCG ACGTGCGGGT TGAAAAGGCA
GTTGATGGGC AGTGGGAAGA GTCCATCAGT CCCGGACAAC TCGTCTCTCT TGGCCCGTAT
TTTGAGTTTG AACGACCGTT TCGCCTCACG CCGGGACGCT ACCGGGTCTA TGACCGCACA
CGGGGGAAGG TGTTGATCGA TAACCTGATT GTCGTGCCCG GCAAAGAAAT CACGGTCAGG
CTCAGTGGCG GGTAG
 
Protein sequence
MYHQPRWAIM PLLCGFILLA ACGGAPPPAS QVDVQATIDA GVRATLAAQP TEAPSPTTPP 
PSPTAVPPTA TTAPTSPPVA PTVAPFTPAT PTATGADAPT TVPESSSPPT DTTTIFRPAE
GEAAQVTTNI QLVFDASGSM AQRIGGETKI QAARRAMERI IDTLPDNPDL NVGFRVFGHE
GDSSEAQKAR SCQSTALLVP MQGVNKALLR QQAQAWQPTG WTPISLALQR AGEDFQAGEN
VRNVIIMVTD GEETCGGDPC AVAKALAESQ AEVRIDVVGF GTTPDVAKTL RCIAENSGGV
YTDAQNGDAL VQTLEELIAA TLKRSTLRFI PVSISGAPEE VSLTRLVNAR GEDVMKTVQL
PWMARFAREQ VVELPPGEYR FTISYSEIFV DQTSKHLETT YTAIIEEARE TVAVIGRGQV
TFINDSPQLL RPGDVRVEKA VDGQWEESIS PGQLVSLGPY FEFERPFRLT PGRYRVYDRT
RGKVLIDNLI VVPGKEITVR LSGG