Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2388 |
Symbol | |
ID | 5539869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3076177 |
End bp | 3077223 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894520 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001432488 |
Protein GI | 156742359 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.689816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCATC GCCTGGTTAT CATACTGATG ATCATAGTTG CACTGGCGGC GGCTGGATGC GGTGCAACGC CAGCCGCAAC GCCGACTATG CCGGCCGCTG CGCCGCCGCC AACCGAATCG GCGACCCTGC GCCCAATAGT GATGGGATTC CCGTATATTC CGAATGTGCA ATTCGCCCAT TTCTACCTGG CGGATGCGAA AGGGTACTAT GAAGCCGAAG GATTGGACGT CGCCTTCGAT TACAATTTTG AGACCGATGT GGTGCAGCGC GTGGCGCAGG GAACATTGCA GTTCGCGCTG GCGTCGGGCG ATTCGGTGCT GCTGGCGCGT TCGCAAGGTT TGCCGATTGT CACAGTGATG ACGAATAGCC AGCGCTTCCC GACGGTGCTT TTCAGCAAAG CGGAAGCGAA CATCACTACG CCAAAGGACC TGACGCGCGA CGGGGTGACG GTTGGCATTC CAGGGCGCTT CGGCGCCAGC TGGATCGGTT TGCTGGCGTT GCTCTACGCT GAGAACATCC CGCGAGAAGC GGTCAACGTT CAAGAGATCG GTTTCACGCA GGTGGCGGCG ATCACCGAGG GGAAAGTGAC GGTTGCAACC GGGTACGGCA ACAACGAGCC GATTCAACTG GAGCGGCAGG GCATTCCGGT GAATGTCATC CGTATCGCCG ATTATTTCCC GCTGGCATCC GACGGGCTGA TTACCGGTGA GCAACTCGTT GCCGGCGATC CCGACGTGGT GCGCAAGTTC GTGCGGGCAA CCCTGCGTGG CATGGCGGAT GTGATCGCCG ACCCTGACGC TGCATTCACC ACTGCTCTCG ATTACATCCC CGAACTCAAG GGCGCCGATC AATCGACGCA GGACCTTCAG CGCGCCGTGC TCCAGGCGAC GCTCGACTAC TGGCAGAGCG ACAAAACGAA GACCGAGGGG CTGGGGTTCT GCGATGAAAC GAACTGGCGC GAAACCTACG TCTTCCTGCG TGAGAGCGGT CTGCTGGCGA CCGATGTGGA CGTAACGAAG GCATTTACCA ATCAGTTCAT CAAGTAG
|
Protein sequence | MMHRLVIILM IIVALAAAGC GATPAATPTM PAAAPPPTES ATLRPIVMGF PYIPNVQFAH FYLADAKGYY EAEGLDVAFD YNFETDVVQR VAQGTLQFAL ASGDSVLLAR SQGLPIVTVM TNSQRFPTVL FSKAEANITT PKDLTRDGVT VGIPGRFGAS WIGLLALLYA ENIPREAVNV QEIGFTQVAA ITEGKVTVAT GYGNNEPIQL ERQGIPVNVI RIADYFPLAS DGLITGEQLV AGDPDVVRKF VRATLRGMAD VIADPDAAFT TALDYIPELK GADQSTQDLQ RAVLQATLDY WQSDKTKTEG LGFCDETNWR ETYVFLRESG LLATDVDVTK AFTNQFIK
|
| |