Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0840 |
Symbol | |
ID | 5538306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1099847 |
End bp | 1100647 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640892992 |
Product | histidine triad (HIT) protein |
Protein accession | YP_001430975 |
Protein GI | 156740846 |
COG category | [F] Nucleotide transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.263994 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000000646753 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATCTCA TCTTTCGTTC AGTTCATAAC CCTCATAGCC ATCTTACTGT CAAAGATCGT ACTCAGGCAT CCTTCCCGCT CAGGAATCTG CTCTCGCGAG GGCTGATCAA CGGACGAGTG CTTGATTTTG GATGCGGATT GGGAGCAGAT GTCAATCACT TAAAGAAGCA AGGCCACGAT GTTACTGGTT ATGATCCTTA CTATGCGCCC AATGTCCCAA AGGGAAAGTT CGATACAATT ATATGCCTGT ATGTCTTGAA TGTGTTGCTT CCAGATGAGC AGTCTCATGT GTTGATGGCT ATCTCAGAAT TGCTATATCC AACTGGAAGC GCGTACTTTG CCGTTCGCCG CGATATTCGG CGTGACGGTT TTCGCAATCA CGTCAAATAT CACAAGAACG TTTATCAGTG CCAAGTTACA TTGCCCTATA AAAGCATCTT GCGCACCGAT CATTGTGAAA TCTACCGGTA TCGTCATTTC AATCAGTTGC AGGTAAATGA GTCCTGTTCT TTCTGTGCTC CAGCAAGTAA TTGTGAGTTA TTGACCGAGT CAGCTAATGC CTATGCTGTT TTGGAAAATT CCTCCTCTCT ACCTGGTTAT ACCCTCGTGA TCCCCAAGCG GCACGTCAGT TACTTCGATC TTTCACTTTA TGACAGGAAT GCCTGCTGGC AAGTAGTAGA TCGCGTGAAG ATGCTGCTCA GCGAGCGTTT TCATCCTGAT GGTTTCAGGG TGAAGGTTGG TCAAGGCATA GCCACAGAAT GTGCTGGTTG GCACGGGTGT ATCCATGTAA TCCCCTGTTA G
|
Protein sequence | MNLIFRSVHN PHSHLTVKDR TQASFPLRNL LSRGLINGRV LDFGCGLGAD VNHLKKQGHD VTGYDPYYAP NVPKGKFDTI ICLYVLNVLL PDEQSHVLMA ISELLYPTGS AYFAVRRDIR RDGFRNHVKY HKNVYQCQVT LPYKSILRTD HCEIYRYRHF NQLQVNESCS FCAPASNCEL LTESANAYAV LENSSSLPGY TLVIPKRHVS YFDLSLYDRN ACWQVVDRVK MLLSERFHPD GFRVKVGQGI ATECAGWHGC IHVIPC
|
| |