Gene Rcas_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3898 
Symbol 
ID5541404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5101620 
End bp5103023 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content58% 
IMG OID640896009 
Producthypothetical protein 
Protein accessionYP_001433952 
Protein GI156743823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000694886 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCGG TCTGGGAAGG TTTCTTCATC CACTCGGCAA TGTTCGTTGC CATCGTGTCC 
GCTTTCCACG TGCTCGCTTC GCACCTGACG GTCGCAGCCG CGTGGTTCAA CCTGTACCTG
GAGCGGCGCG CAGTATACGA GAATCGCCCG GAACTATACG TCTATCTCAA GCGAAGCGCC
CTGGGATTGC TGGTGTTCGC GTATGTCTTC GGAGCGATGG CCGGAGTGGG CATCTGGCAA
ACAACCACCG CCGCTAACCC GCGCGGCATT TCTACCCTCA TCCACAACTT CGTGCTCTAC
TGGGGATCAG AGTGGTACAT GTTCCTGATC GACGTGGTAG GAATCATTGC GTATTACTAC
ACGTTCGAGC GCGTCAGCCC GAAGACGCAC CTGCGTCTGG CATGGATCCT GGCATTGGGA
GGCACCGGTA CACTGACAAT CATCGTTGGC ATTCTGTCGT TCAAATTGAC GCCGGGCCTC
TGGTTCGAAA CGGGGGCGAG TCTGAACGGA TTCTTCAATC CCACGTTCTG GCCCCAACTC
TTCATGCGAT TTGCCCTGAT GTTTACCATC ACGGCAGCCT GGGCGCTTTT GATTGTCACC
GGACTGCCGA ACGGGTACTT TGCGCGTGAG CGCATTATTC GCATTGCGGC AGTCATGGGA
CTGGGCGGGT TGATCGTTGC GCTGGGCATC TGGTTCTTCT GGTACGACCC CACTCTGCCG
GCTCACGCGA AGACCATTCT GCGCTCGCCT GCCATTCCAC CGATCACCTT CACGGTCATC
ATCGGCGGTC TGATTGCGAC ATTCCTGGGG CTTCTGTTTG CGCTGGTCAT GCCGCGCCGT
CAGCATCAGA TTATCGCCCT GGGCGCAATG CTGGTGTTGT TTGCAGCCAT CTTCGGCGCC
GAACGCACTC GTGAGGTCAT TCGCAAGCCC GACATTATTG CCGGCTATAT GTCGTCGAAT
CAACTGGTAT TCAACGATCT TCCCGCTCGC GGCATTCAGC GCGAAGAGCA ACCGTTGAAC
GAAACCGGCA TGCTCGGCGC ACTGCCGTTT CTGCCGCGCC CGGATCAGAT TTCAGTCGCA
GCAACGGGCG CTTCCAGCCA TCAGGTTGCT ATGGGACGGG TGCTGGTCAT TCAACAGTGC
GCGGCTTGCC ACAATGTCAG CAACCAGACG GCGATCACCG TCTTCGATCA GCGGCTGGCG
TTGCGTTCGC TGGCGCAGTT GCTCGAACGA CGCAAAATGA CGACCGCGCC AAAAATTGAG
ACCTACCTGA ACGGCATTGG GGCGTTCCCA TATATGCATC CCGTCGTCGG CACGCCCGAA
GAGCGGGCGG CGATGGCACT ATACCTGGAG TATTTCTTGC AACAACAGCA CGCACCACAG
TCACAGGCGC AAGCCAGGAG GTGA
 
Protein sequence
MYPVWEGFFI HSAMFVAIVS AFHVLASHLT VAAAWFNLYL ERRAVYENRP ELYVYLKRSA 
LGLLVFAYVF GAMAGVGIWQ TTTAANPRGI STLIHNFVLY WGSEWYMFLI DVVGIIAYYY
TFERVSPKTH LRLAWILALG GTGTLTIIVG ILSFKLTPGL WFETGASLNG FFNPTFWPQL
FMRFALMFTI TAAWALLIVT GLPNGYFARE RIIRIAAVMG LGGLIVALGI WFFWYDPTLP
AHAKTILRSP AIPPITFTVI IGGLIATFLG LLFALVMPRR QHQIIALGAM LVLFAAIFGA
ERTREVIRKP DIIAGYMSSN QLVFNDLPAR GIQREEQPLN ETGMLGALPF LPRPDQISVA
ATGASSHQVA MGRVLVIQQC AACHNVSNQT AITVFDQRLA LRSLAQLLER RKMTTAPKIE
TYLNGIGAFP YMHPVVGTPE ERAAMALYLE YFLQQQHAPQ SQAQARR