Gene Rcas_1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1710 
Symbol 
ID5539188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2206110 
End bp2208206 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content61% 
IMG OID640893849 
Productvon Willebrand factor type A 
Protein accessionYP_001431820 
Protein GI156741691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.425591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG TGACCGAACG CCTTGTGTTT CCGATCCGAA TGCCGGCAGT GCGGATGCTG 
TGGGTTGCAA TGCTCATTGC AATCTTGCTC CTGCCGACAT CTCCGGCGAC GGCGCAGCAG
ACCGGGCAGG CGCTCGATTC AGGAAATAGC GACGTGGTCC TGATTATCGA CAACTCCGGC
AGCATGAAGC AAAACGATCC GCAGAACCTG CGTCTCGCCG CAGCCAATCT CTTTATCGAT
CTATCGGACC CGCGCGACAA AATCGGCATT GTTGTTCTTT CCGACCGTAT GCGCACGCGT
TCGCTGACGA AGAATCTTGT CCGTATCGGC AGCCGGCAGG ACATCGATGA GTTGAAGGGA
CTGGTCGATG CGCTGCGCAA CGAGACAAAA GGGCAGGAGA CGCATATGGG AACGGCGCTC
GATCTGGCGT ATGACCTGCT CGACGCAACG CCGGGATCGA ACCGGGGCGC CAACCAGCGT
CAGTTTGTCG TGCTCCTCAG CGACGGATTG CCAACGGGCG TCGGGCAACG CGAGCGCGTC
GATCAAGCCG TACAGCGCTT CCGTGAGCGA CGGTACTGGA AGATATTCTC CATTGCGCTG
GGCGATGAAG CGGATCCGGC GTATCTCGAC GAGAAGGTGT CCTCTCCCTC CGGCGGGCAG
GTGGTCGTCG CCCGCCATGC CGGTGAACTG CTCGACCGGT ACCTTGACGT GTATGCGCGC
GCCGGTGATG ACCGGTACAT CAATTATGTC ACAGTGCAGC CGAACACGCT GGCGCCGCTG
GTTGACGTGC GGCTGGATCA CCAGCCGACG CAGATCGGCG TCGTGCTGGT GCGTGGCGAC
AGCAATGCCA GCATTAGCAG TCTATTGGCG CCAGACGGGG CGGATCTGGT GCAACCCTAC
TACCAGAATA GCGTGCGGCG CGGGGCTGAA CCGGAATACG AACTGTATAC GGCAATGTCA
ACCGACCAGG TATCGCTCGT TGGGCGCTGG ATGATCAATG TTGATCGTCC TGATGCGTTG
CCAACCACAA TTGCCGTGCT GAGTCGCTCG CGGCTACGCA TACGGATGCC AGCGCCGGCG
CCGCTGCGTG ACAACGAGGA TACCAGCCTG CGCTACCATC CGGTCGGACG ACCGCTACTG
TTGGTGGTTG GGGCGCAGGT TGCCGAACGA AATTACGATC AGCATGTTAC TACTCCCTAC
CTGTACCGCT GGGTGGCAGA CATGGCGCCG GCAGCGCACA TGCTCACGCC GTTCGAGGGT
CCACCCATTG TGCTGGTAGA CGATGGACGC GCCTGTGATC AGCGCGCAAA CGATGGACGT
TACAGCGGCG TACTGCCGCC TTTCCCCACT GAAGGGGATT ATACGTTGCG TCTGGAATTC
CCTGGGGCGC ACCCGAACCC TATCCATGTG CAGAAAGACT ACATCGTGCG CGTGGCTGCG
TTGCCGACCA TGACGATAAC GCTGCCGCCG GCTGCGACAA CCCTGCCGAT CAATACGCCA
TTGACCGCCT GGATCGATCT GCCCGGAAGG GCAGACTTCG AGATTGTGAA TGTGATGTTC
CCAACAGCGT TTGTGCAGCG TCCCGACGGG GTGCTCGAAA CGCTGGAGAT TGAGTCGGTG
GACCGCGGAC GTTTCCGTTT CCGCTACACA CCCGGCTTCG AGGGACAGTA CCGCATCAAT
ATTGCGGCAG AGGTGCATGG GCGAGGTGCA ATGGGGGACA TCCGGTACAT CGACTATGCC
GACGCCCTGA TCGGCGTACC GAAGGCGACG CCGATCGTTG AGATCAGCGC TGCCTTCACC
GGCACGCTGG TCTACGACCG GCGAGGCATT TTGAGCGTTC CTCTCAAGAT TGCATCACGC
TCTCCACAGG AAGAGCGCCT GGTGATCACA GTGACCAACC CGGCAGGCGC GATCACGGTG
CCGGCAGAGG TGCTTTTGCA GCCAAACGAG TCCATACAGC GCACAATCAG CGTGCGACTG
CCGGAGAAGG ATCGTCCCGC GCGCGGCGCC CTGATGCTCC AGTTGACGGC GCCAGAGCAG
CGCGTGATTG TTCAGGGTGA GACCATCAGC GTCGCTATCG TGCGCCTGCC GGTCTGA
 
Protein sequence
MTIVTERLVF PIRMPAVRML WVAMLIAILL LPTSPATAQQ TGQALDSGNS DVVLIIDNSG 
SMKQNDPQNL RLAAANLFID LSDPRDKIGI VVLSDRMRTR SLTKNLVRIG SRQDIDELKG
LVDALRNETK GQETHMGTAL DLAYDLLDAT PGSNRGANQR QFVVLLSDGL PTGVGQRERV
DQAVQRFRER RYWKIFSIAL GDEADPAYLD EKVSSPSGGQ VVVARHAGEL LDRYLDVYAR
AGDDRYINYV TVQPNTLAPL VDVRLDHQPT QIGVVLVRGD SNASISSLLA PDGADLVQPY
YQNSVRRGAE PEYELYTAMS TDQVSLVGRW MINVDRPDAL PTTIAVLSRS RLRIRMPAPA
PLRDNEDTSL RYHPVGRPLL LVVGAQVAER NYDQHVTTPY LYRWVADMAP AAHMLTPFEG
PPIVLVDDGR ACDQRANDGR YSGVLPPFPT EGDYTLRLEF PGAHPNPIHV QKDYIVRVAA
LPTMTITLPP AATTLPINTP LTAWIDLPGR ADFEIVNVMF PTAFVQRPDG VLETLEIESV
DRGRFRFRYT PGFEGQYRIN IAAEVHGRGA MGDIRYIDYA DALIGVPKAT PIVEISAAFT
GTLVYDRRGI LSVPLKIASR SPQEERLVIT VTNPAGAITV PAEVLLQPNE SIQRTISVRL
PEKDRPARGA LMLQLTAPEQ RVIVQGETIS VAIVRLPV