Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3769 |
Symbol | |
ID | 5541271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4941135 |
End bp | 4942457 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640895879 |
Product | hypothetical protein |
Protein accession | YP_001433826 |
Protein GI | 156743697 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1 [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.96163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0729973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATG AACGAGTCGT GCTCATCACC GGTGGCGGTT CAGGAATCGG GCGCGCAACA GCGCTGGCGT TCGGCAGGCA CGGCGCGCGC GTAGTCGTCG GCAATCGGGA TCGTGACGCC GGTGAAGCGA CGGTCGCCGC AATCCGGGCA ATGGGTGGAA CGGCGTTGTT CGTTCCGACC GATGTCACCC GACCAACAGC GGTGCGCGCG CTGATCGACG CAGCGGTTGA AACGTTTGGA CGCCTCGATG TTGCATTCAA CAATGCTGGA TGGTTCGGAT CGGTCGCGCC GCTGGCAGAA CAGGACGAGC ATGAGTTCGA CCCTGTTTTT GACACAAATG TACGCGGCGT TTTTCTGTGC ATGAAGTACG AACTGGCACA GATGCTGAAA CAGGGGCAGG GTGTCATCAT TAACAATGCA TCGACGACCG GCATACGCAA TTCGACAATG GGTGTGGCGC TGTATGCCGC AGCAAAAGCG GCGGTGATTT CCCTGACACG CTCTGCCGCC ATAGAATACG CCGCTCATGG CGTGCGCATT AATTGCAGTA GCGCCAGGAC GGATTGCGAC CGACATGCTG GCGAAAGCAG GCGGCGGCAA CCCGGAACGT TTTGCTGCGG TGATTCCCAT GCGACGCCTT GGTACATCAG AAGAAGTCGC TCAGGCAGTT CTCTGGCTGG CATCATCGGC GGCGTCATTC GTCACCGGTC AGGTGCTGGG GGTCGATGGC GGATACCTGG CATCATAATA CACATAGAAG GAGACACAAC GATGCATGTC TGCGTCTACT GCGCCTCGAG CGATCACGTA CCGGCACTCT ATCTGGAAGC GGCCCATACC TTCGGTGAAG GGATGGCGCG GCGCGGCTGG ACGCTGGTGT ACGGCGGGGG TGGCATCGGA TTGATGGGCG CGGTCGCGCG AGCGGTACAC GGCGCCGGCG GACGGGTCAT TGGCGTTATT CCACAAACGC TGCTGGAACG CGAAGTCGGC TATCAGGAAG CGGACGAACT GATCGTCACC GGAACTCTAC GCGAACGCAA GCAGATCATG GATGATCGCG CCGATGCGTT CGTCGCGCTG CCAGGCGGCT TCGGCACACT CGAAGAATTG CTGGAAATCA TGACGCTTCG CATGCTTGGC TACCACAACA AGCCGATCGT CATCGTCAAC ATTGGCGGTT ATTTCGATCC GCTCCTGACA CAGTTCGAGT ATATCTTCAC CCAAAACTTT GCCCACGAAC GCTACCGCCG TCTCTACGCC GTGAAAGCCG ATCCTGAAAC AGCCCTCACC TATCTGGAGC GAATCACGCC TGGCTCGCCG TAA
|
Protein sequence | MFDERVVLIT GGGSGIGRAT ALAFGRHGAR VVVGNRDRDA GEATVAAIRA MGGTALFVPT DVTRPTAVRA LIDAAVETFG RLDVAFNNAG WFGSVAPLAE QDEHEFDPVF DTNVRGVFLC MKYELAQMLK QGQGVIINNA STTGIRNSTM GVALYAAAKA AVISLTRSAA IEYAAHGVRI NCSSARTDCD RHAGESRRRQ PGTFCCGDSH ATPWYIRRSR SGSSLAGIIG GVIRHRSGAG GRWRIPGIII HIEGDTTMHV CVYCASSDHV PALYLEAAHT FGEGMARRGW TLVYGGGGIG LMGAVARAVH GAGGRVIGVI PQTLLEREVG YQEADELIVT GTLRERKQIM DDRADAFVAL PGGFGTLEEL LEIMTLRMLG YHNKPIVIVN IGGYFDPLLT QFEYIFTQNF AHERYRRLYA VKADPETALT YLERITPGSP
|
| |