Gene Rcas_3769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3769 
Symbol 
ID5541271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4941135 
End bp4942457 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content59% 
IMG OID640895879 
Producthypothetical protein 
Protein accessionYP_001433826 
Protein GI156743697 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.96163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0729973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATG AACGAGTCGT GCTCATCACC GGTGGCGGTT CAGGAATCGG GCGCGCAACA 
GCGCTGGCGT TCGGCAGGCA CGGCGCGCGC GTAGTCGTCG GCAATCGGGA TCGTGACGCC
GGTGAAGCGA CGGTCGCCGC AATCCGGGCA ATGGGTGGAA CGGCGTTGTT CGTTCCGACC
GATGTCACCC GACCAACAGC GGTGCGCGCG CTGATCGACG CAGCGGTTGA AACGTTTGGA
CGCCTCGATG TTGCATTCAA CAATGCTGGA TGGTTCGGAT CGGTCGCGCC GCTGGCAGAA
CAGGACGAGC ATGAGTTCGA CCCTGTTTTT GACACAAATG TACGCGGCGT TTTTCTGTGC
ATGAAGTACG AACTGGCACA GATGCTGAAA CAGGGGCAGG GTGTCATCAT TAACAATGCA
TCGACGACCG GCATACGCAA TTCGACAATG GGTGTGGCGC TGTATGCCGC AGCAAAAGCG
GCGGTGATTT CCCTGACACG CTCTGCCGCC ATAGAATACG CCGCTCATGG CGTGCGCATT
AATTGCAGTA GCGCCAGGAC GGATTGCGAC CGACATGCTG GCGAAAGCAG GCGGCGGCAA
CCCGGAACGT TTTGCTGCGG TGATTCCCAT GCGACGCCTT GGTACATCAG AAGAAGTCGC
TCAGGCAGTT CTCTGGCTGG CATCATCGGC GGCGTCATTC GTCACCGGTC AGGTGCTGGG
GGTCGATGGC GGATACCTGG CATCATAATA CACATAGAAG GAGACACAAC GATGCATGTC
TGCGTCTACT GCGCCTCGAG CGATCACGTA CCGGCACTCT ATCTGGAAGC GGCCCATACC
TTCGGTGAAG GGATGGCGCG GCGCGGCTGG ACGCTGGTGT ACGGCGGGGG TGGCATCGGA
TTGATGGGCG CGGTCGCGCG AGCGGTACAC GGCGCCGGCG GACGGGTCAT TGGCGTTATT
CCACAAACGC TGCTGGAACG CGAAGTCGGC TATCAGGAAG CGGACGAACT GATCGTCACC
GGAACTCTAC GCGAACGCAA GCAGATCATG GATGATCGCG CCGATGCGTT CGTCGCGCTG
CCAGGCGGCT TCGGCACACT CGAAGAATTG CTGGAAATCA TGACGCTTCG CATGCTTGGC
TACCACAACA AGCCGATCGT CATCGTCAAC ATTGGCGGTT ATTTCGATCC GCTCCTGACA
CAGTTCGAGT ATATCTTCAC CCAAAACTTT GCCCACGAAC GCTACCGCCG TCTCTACGCC
GTGAAAGCCG ATCCTGAAAC AGCCCTCACC TATCTGGAGC GAATCACGCC TGGCTCGCCG
TAA
 
Protein sequence
MFDERVVLIT GGGSGIGRAT ALAFGRHGAR VVVGNRDRDA GEATVAAIRA MGGTALFVPT 
DVTRPTAVRA LIDAAVETFG RLDVAFNNAG WFGSVAPLAE QDEHEFDPVF DTNVRGVFLC
MKYELAQMLK QGQGVIINNA STTGIRNSTM GVALYAAAKA AVISLTRSAA IEYAAHGVRI
NCSSARTDCD RHAGESRRRQ PGTFCCGDSH ATPWYIRRSR SGSSLAGIIG GVIRHRSGAG
GRWRIPGIII HIEGDTTMHV CVYCASSDHV PALYLEAAHT FGEGMARRGW TLVYGGGGIG
LMGAVARAVH GAGGRVIGVI PQTLLEREVG YQEADELIVT GTLRERKQIM DDRADAFVAL
PGGFGTLEEL LEIMTLRMLG YHNKPIVIVN IGGYFDPLLT QFEYIFTQNF AHERYRRLYA
VKADPETALT YLERITPGSP