Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1226 |
Symbol | |
ID | 5538693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1584336 |
End bp | 1586153 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640893359 |
Product | 5'-nucleotidase domain-containing protein |
Protein accession | YP_001431341 |
Protein GI | 156741212 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATGC CCACGATCTC GCGGCGCACG TTTGTGAAGA CGATGGCGGT TGGATCGGGA ACAATCGCCT ACCTGGCGAC GGCGCTGACG GTGCACGGCG CCGGTCCAGA GGTCTACACG CTGCGCATTG TCCACACGAA CGATCATCAC GCACGCATCG AGCCGGTGTT CAGTGGCGCC AACCCCGTCC ATGGCGGCGT TTCGCGGCGT AAGACGCTGA TCGATGCCAT TCGAAACGAA GGCGGCAATC AGTTGCTACT CGACGCTGGC GATGTGTTTC AGGGCACGCT CTACTTCAAC CAGTACCGCG GGCTGGCAGA CCTTGAGTTC TACAATGCAC TCAAGTACGA TGCCATGGCG ATTGGCAACC ACGAGTTCGA CATCGGGCAG GCGCCGCTGG CGGATTTTGC GCGCGGCGCA ACTTTCCCGC TGCTCAGCGC CAATATTCAG GTCGATCGCT CGTCGCCGCT CTTTGGTCTC ATTAAGCCGT GGGTCGTCGT CTGGGTCGGT GGTCAACCCA TCGGTATCAT TGGCGTGACC ACCGAAGACA CGCCGGTGCT CAGCAATTCC GGTCCTGGCG TCAGGTTCAC CAACTATATC GATGCAGTGC GCCTGGGGGT TGAGTCGCTG CGCCGCGATG GAGTCAACAA GATCATTGCG CTGACCCACG TCGGCATTCA GGCAGACCGT GAACTGGCGC GGCGTGTCGA TGGTTTGTCG GTCATCATTG GCGGGCACAG CCACACGCCG ATGGGTCCGA TGGTCAATCC GCAATCACCT GATCGACCCT ACCCCGAAGT CATTGCCTCA CCCTCGCGCA AGCCGGTGAT CGTGGCGCAC GATTGGGAGT GGGGGCGCTG GCTTGGCGAC CTGACGATTG GCTTCGACGC CAACGGCGAC ATTACGCGCG TGGTTGCAGG GCGCCCCACC GAAGTGTTGC CCGCGATCAA TCCTGATGGC GGTTTCGAGA ACCGGATCAG AACCTTCAAA GGTCCTCTGG ATCAACTGCG CGCGACACCA GTTGGCGAAG CGCGCGTGGC GCTCAATGGC GCTCGCGCCG ATGTCCGCTC GAAGGAAACC AACCTGGGGA ACCTGATCGC CGATTCGATG CTGGCGAAGA CGGCGCCGGC CGGCGCGCAG TTGGCCATTA TGAACGGCGG TGGTATCCGC ACCAGCATAC CTGAAGGACG CATCACCCTT GGTCAGGTGC TCGAAGTCAT GCCATTCGGC AACACCCTTG TGCTGCTGAC CCTTACCGGC GATCAGGTCA AGGCAGCGCT GGAGAATGGC GTCAGTCAGG TGGAACAGTC CGCCGGGCGC TTTCCGCAGG TCAGCGGTAT GCGTTATAGT TGGAACGCTT CGGCGCCAGC CGGGAGCCGC ATTACCGGCA TTCAGGTCTC TGATGGAAGA GGCGGGTTTG TGGCTATCAA TCCGAACGCG ACATACCGCG TGGTCGTCAA CAACTTTATC GCTGGCGGCG GAGACGGCTA CAGTGTGTTG CAGCAGGGAA CGAACAGGGT GGACACCGGC TTTCTCGATG CCGATGTGCT GGTGGAATAC CTCCAGGCGC GTTCGCCCGT CAGTCCGCAG GTCGAAGGGC GCATCGTGCA GAATGGCACG CTGCCAGGCG CAGCCGCGTC AGCCCCGGCG CCAGCCGAAA TGCCGGTGGC GTTGCCGCGC ACCGGCGGCG AGTCGTTGCC CGCGTGGTTG CTGGCTCTGG CGGCAGCCGG AGCGATTGGC GGCGGTCTAC GGCTGCGTGA GCGCGCTGCG CGCATGGCAA CCGCCGATGA ACACGAACCC GTCACCGTCA ACCAGTAG
|
Protein sequence | MSMPTISRRT FVKTMAVGSG TIAYLATALT VHGAGPEVYT LRIVHTNDHH ARIEPVFSGA NPVHGGVSRR KTLIDAIRNE GGNQLLLDAG DVFQGTLYFN QYRGLADLEF YNALKYDAMA IGNHEFDIGQ APLADFARGA TFPLLSANIQ VDRSSPLFGL IKPWVVVWVG GQPIGIIGVT TEDTPVLSNS GPGVRFTNYI DAVRLGVESL RRDGVNKIIA LTHVGIQADR ELARRVDGLS VIIGGHSHTP MGPMVNPQSP DRPYPEVIAS PSRKPVIVAH DWEWGRWLGD LTIGFDANGD ITRVVAGRPT EVLPAINPDG GFENRIRTFK GPLDQLRATP VGEARVALNG ARADVRSKET NLGNLIADSM LAKTAPAGAQ LAIMNGGGIR TSIPEGRITL GQVLEVMPFG NTLVLLTLTG DQVKAALENG VSQVEQSAGR FPQVSGMRYS WNASAPAGSR ITGIQVSDGR GGFVAINPNA TYRVVVNNFI AGGGDGYSVL QQGTNRVDTG FLDADVLVEY LQARSPVSPQ VEGRIVQNGT LPGAAASAPA PAEMPVALPR TGGESLPAWL LALAAAGAIG GGLRLRERAA RMATADEHEP VTVNQ
|
| |