Gene Rcas_2888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2888 
Symbol 
ID5540377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3745802 
End bp3747082 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content61% 
IMG OID640895008 
Producthypothetical protein 
Protein accessionYP_001432968 
Protein GI156742839 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCTC TGCGTCCTAT CGCGCTGGTT GCGCTTGCAC TGCTGCTCTT CGTGGCGGCG 
CACGGCACGG GGATCGATCT GTTTTTCCAA CTGAGCTACC TGTTAGTTGG CATTGTGATT
GTAGCGTACC TGTGGGCATG GCTCAATCTG CGTGGTTTGA GCGTGCGGCG CGAGGTTTTC
ACCCATCGCG CACAGGTCGG AGATGTGGTG CGCGAGCGGA TGACGCTGGT CAACCACTGG
TTCCTGCCCA AACTCTGGAT CGAGGTGATC GACCGTTCAA ACCTGCCCGA CCACAGTGCC
GGTTTCGTCG CGTATCTACC AGGATACGAT CAGCGGCGCC AGGTTATTCG CACCACCTGC
ACCATGCGCG GGAAGTTCCG GTTGGGTCCG GTGACGCTGG TAAGTAGTGA CCTGCTGGGT
CTGTTTCGTT TCCAGCGCGA TATTCCAGGC GATAACGAGA TTCTGGTCTA CCCGCGCACC
GTTCCGCTGC CAGGTTTCGT GCTGCCCGGC GCAGAATTGC CGGGTGGCCA GGACCTCCGG
CGACGCACGT ACCACGTAAC ACCAAATGTC GCCGCCATTC GCGACTATCA ACCCGGCGAC
GGTTTCAACC GCATTCACTG GCGGAGCACA GCGCGCCTGG GCAGGTTGAT GGTGAAGGAG
TTTGAACTCG ATCCGACTGC CGAGGTTTAT GTGGCGCTCG ATATGCATGA ATATGTGCAG
CAGGCATGGC GGCCCGTAGA AAGAACTTCG GGCAGGCAGT TCCGGCGAAC CACCGAGTCG
ACCGAGGAAT ATGCGGTGCA TGCCGCAGCA TCGATTGCGC GTCATGTGCT CGAGCAGAAT
CGTGCCGTGG GGTTGATCGC CTGGGGACAG CGCCGCGAAG TCATTCCGCC TGAGCGCGAG
GCGCGGCAGT TGTACAAAAT CCTGGAGGCG CTGGCGGAAC TGCGCGCCTA TGGGTCTGCG
TCGCTGGCGG AAGTGTTGAG CGCCGAAAAC GCACGCTTCG GGCGCAACTG CACGCTGGTG
GTGATTACTC CGTCGCTGGA TGAGCGGTGG GTTACAGGAG TCCAGCACTT GCGGTATCGA
GGGGTGCGTA TCGTTGCGAT TCTGATCGAT GCGGAGTCGT TCGGCGGTGG GCGCAGCAAC
GAGTCGATCC GTGGACGCCT GGCAGAACTG CGCGTGCCAA CCTGTGTCTG GCAACGCGGA
CAACCGCTGA CGACGGCGCT TGCTCAGACC GCCGCAATGG GTCTTCACCA TGCGGGAGCG
CCGCACGCTC GCCCATCGTG A
 
Protein sequence
MHALRPIALV ALALLLFVAA HGTGIDLFFQ LSYLLVGIVI VAYLWAWLNL RGLSVRREVF 
THRAQVGDVV RERMTLVNHW FLPKLWIEVI DRSNLPDHSA GFVAYLPGYD QRRQVIRTTC
TMRGKFRLGP VTLVSSDLLG LFRFQRDIPG DNEILVYPRT VPLPGFVLPG AELPGGQDLR
RRTYHVTPNV AAIRDYQPGD GFNRIHWRST ARLGRLMVKE FELDPTAEVY VALDMHEYVQ
QAWRPVERTS GRQFRRTTES TEEYAVHAAA SIARHVLEQN RAVGLIAWGQ RREVIPPERE
ARQLYKILEA LAELRAYGSA SLAEVLSAEN ARFGRNCTLV VITPSLDERW VTGVQHLRYR
GVRIVAILID AESFGGGRSN ESIRGRLAEL RVPTCVWQRG QPLTTALAQT AAMGLHHAGA
PHARPS