Gene Rcas_1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1363 
Symbol 
ID5538835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1742779 
End bp1745697 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content64% 
IMG OID640893500 
Productvon Willebrand factor type A 
Protein accessionYP_001431477 
Protein GI156741348 
COG category[S] Function unknown 
COG ID[COG5426] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0717995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGCC TCTCGTTCAT TACGCCTCTC GCCCTCACGC TCCTGGCGCT GATCCCGGCG 
CTGTGGGCTT TGACGCTGCT GACGCCGCGC CGCCTGGCCC CCTGGCGCTT CTGGTCGAGC
CTGGTTCTGC GCAGCATCAT CCTTCTTGCG CTGACGCTCG CCCTCGCCGG GACGCAGATC
GTCCTGCCGG TGCGCGAACT GACGACGGTG TTCCTGGTCG ATGTGTCGGA CTCGATGACG
CCGGCGCAAC GCGAACGCGC CTTGCAGTAC GTTAACGACG CACTGGCTGC CATGCCTCCA
GGCGACCAGG CGGCTGTCGT GGTGTTCGGC GACAATGCGC TGGTGGAGCG CGCTCCTGGT
CCAATCGGTC CGCTGAGTCG CCTGACGTCG GTTCCGATCA CGACACGCAC CAATCTCCAG
GAGGCAGTAC AGTTGGGGCT GGCGCTCTTC CCGGCGGAAA CGCAGAAGCG CCTGGTGCTC
ATTTCGGACG GCGGCGAGAA TGCCGGGCGT GTGGCGGATG CAGCGCAACT CGCGGCTATT
CGGAAGGTGC CAATCGATGT CGTCTACATG CCGGGCGAGC GAGGTCCTGA TGTCATCGTT
GCCGGGCTGA GCGCGCCAGC CGTCGTGCGT GAAGGGCAGG ACCTCACGTT GCAGGCGAAT
ATCACGTCCA ACTATGCGAC GAGCGGACGT TTGCAAACGT TTGTGGACGG GCAACTGATC
GGTGAGCAGG AACTCTCCAT CCCTGAAGGA GCGAGCACCA TCGATATTCG CGTCCCTTCG
GGCGAAACCG GGTTTCGCCG CATCGAAGTG CGCCTCGACG CCGATGGGGA CACAGAGCCG
CAGAACAATC GTGGGGCGGC GTTCACCGAA GTGTTGGGAC CGCCGCGCCT GCTGTTGATC
GCCTCCAACG AGGCGCGCGC CGTCAATCTG CGCGACGCGC TGCGCGCCGC CGAGGTGCGT
GTCGATGTCC TCCCGCCGGA TCAGGCGCCC GCCACTCTGG ATCAGCTCGG CGCCTACGCT
GGGGTGATAA TTGTCGATAC GCCAGCGCGC GATATGCCTC GCACATTGAT GGAGGCGCTG
CCGGTTTATG TGCGTGAACT GGGGCGTGGG CTTGCCATGG TCGGCGGCAT CGATTCATTT
GGCGCCGGGG GGTATCGGCG CACGCCGCTG GAGCCAGTGC TACCGGTGTT GCTCGATCCG
CTCGACACAA AGCAGCAACC GGACCTGGCA CTGGTGATGG TGATCGACCG CAGCGGCAGC
ATGTCGGAGT TGGTGGGCGG AAGCCGACGC AACCGGCTCG ACCTCGCCAA GGAAGCGGTT
TATCAGGCAA GCCTTGGTCT GACCCCGATC GATCAGGTCG GGCTGGTTGT GTTCGACGAT
GCGGCGAATT GGGTGCTGCC GCTGCAACGC TTGCCTTCGG TCGTCGAAAT CGAACGGGCG
CTCGGTTCGT TTGGCATCGG CGGCGGCACG AATATTCGAC CGGGCATCGA ACAGGCAGCA
CAGGCGCTGG CTTCCGCCGA TGCAAAGGTC AAGCACGTCA TTCTGTTGAC CGATGGCATC
GCAGAAAGCA ACTACAGCGA TCTGATCGCG CAGATGCGCG CCGCCGGCGT CACCATTTCC
ACGGTTGCAA TCGGTGAAGA CGCTAATCCC AATCTGGTCG ATGTGGCGAA TGCCGGCGGC
GGTCGTTCCT ATCGTGTGAC CAGGATCGAG GACGTGCCGC GCATTTTTTT GCAGGAGACA
ATCATCGCCG CCGGGCGCGA TATTGTCGAG GAGCGGATTG AACCGCAGGC GGGTCTTCCC
TCGCCGATCA TCCGCAGTCT GGGAGGGTTG CCGCCGCTCT ATGGCTACAA TGGCACCGAG
GTGCGCGAAG CGGCGCGCAC GTTTTTGTTC ACACCTGATG GGAAGCCTTT GCTGGCGCAA
TGGCAGTATG GTTTGGGGCG CGTTGTCGCC TGGACGAGCG ACGCACAGGG GCGCTGGGCG
CGCGACTGGA TTGCCTGGGA TCAGTTTCCG CGCTTTGCCG GCGGGATGAC CGATCTCTTG
CTTCCACCAC GCGAAAGCGG AACGCTCGAA CTCCGCGCGA CTGCCGCTGG TCCGCGCGCA
TTGATCGAGT TGACCGCTCA GGACGAGCAG GGACGTCCGC TTAACAATCT GGTTATTGCA
GGGCGCGCCG TCGATCCGCA GAATCAGGGA ACTGCGGTCC AATTTCAGCA GATCGGTCCG
GGCCAGTATC GCGCAGTCGT CGATACATCG TCGCCGGGCG TCTACCTGGC GCAGGTGGCG
GTTTCCGATG CGGAAGGACG ACAAATTGGC GTCGCGGTGA CCGGCATTGT GGTCAGTTAT
TCGCTGGAAT ACAGCGCGCA GCGCGAAAAT CTGCCACTGC TCAGCGACGT AGCAGGCATC
AGCAGCGGGC GGATCAATCC TCCACCTGAT GTGGTCTTTG CGTCACCCAA TCAAAATGTC
GGTTCAGTGC GAGAGATTGG ACTTCCGCTC CTCTGGCTGG CCCTTCTCCT CTGGCCCCTC
GATATTGCCG CGCGGCGTGT GATGGTACGG ATGGACGACG TGGCGCCGTG GCTTGAACGG
CTCCGTCGGC GGCGACCGTC GTCGGTGGCT GCGCCGGAGG CGGCGTCAAC CATGACACGA
CTCGGCGCTG CGAAACGGCG CGCGATGATT GCGCGCACAT CGCCGAATCG CGCCGCTGCC
AGCTCGGAGC AATCGGTTAC GCCGCCGGTG ATGACGCAGT CTCGCCAGAC GCCACCTCCT
GCGCCAGAGT CCCGCCCGTC TGCGCCAGCG TCGGGCGCAT CCGAGACGCG CGCCCGATCA
TCCGAAAAAC GTCCGGTCTC GCCGGAAGCG ACGGAAGAAC AGTTCGCCCG GTTGTTGGCG
GCGAAACAGC GCGCGCGGCG CAAATCGGAG GAGCGGTAA
 
Protein sequence
MIRLSFITPL ALTLLALIPA LWALTLLTPR RLAPWRFWSS LVLRSIILLA LTLALAGTQI 
VLPVRELTTV FLVDVSDSMT PAQRERALQY VNDALAAMPP GDQAAVVVFG DNALVERAPG
PIGPLSRLTS VPITTRTNLQ EAVQLGLALF PAETQKRLVL ISDGGENAGR VADAAQLAAI
RKVPIDVVYM PGERGPDVIV AGLSAPAVVR EGQDLTLQAN ITSNYATSGR LQTFVDGQLI
GEQELSIPEG ASTIDIRVPS GETGFRRIEV RLDADGDTEP QNNRGAAFTE VLGPPRLLLI
ASNEARAVNL RDALRAAEVR VDVLPPDQAP ATLDQLGAYA GVIIVDTPAR DMPRTLMEAL
PVYVRELGRG LAMVGGIDSF GAGGYRRTPL EPVLPVLLDP LDTKQQPDLA LVMVIDRSGS
MSELVGGSRR NRLDLAKEAV YQASLGLTPI DQVGLVVFDD AANWVLPLQR LPSVVEIERA
LGSFGIGGGT NIRPGIEQAA QALASADAKV KHVILLTDGI AESNYSDLIA QMRAAGVTIS
TVAIGEDANP NLVDVANAGG GRSYRVTRIE DVPRIFLQET IIAAGRDIVE ERIEPQAGLP
SPIIRSLGGL PPLYGYNGTE VREAARTFLF TPDGKPLLAQ WQYGLGRVVA WTSDAQGRWA
RDWIAWDQFP RFAGGMTDLL LPPRESGTLE LRATAAGPRA LIELTAQDEQ GRPLNNLVIA
GRAVDPQNQG TAVQFQQIGP GQYRAVVDTS SPGVYLAQVA VSDAEGRQIG VAVTGIVVSY
SLEYSAQREN LPLLSDVAGI SSGRINPPPD VVFASPNQNV GSVREIGLPL LWLALLLWPL
DIAARRVMVR MDDVAPWLER LRRRRPSSVA APEAASTMTR LGAAKRRAMI ARTSPNRAAA
SSEQSVTPPV MTQSRQTPPP APESRPSAPA SGASETRARS SEKRPVSPEA TEEQFARLLA
AKQRARRKSE ER