Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_0801 |
Symbol | |
ID | 4032358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 884010 |
End bp | 885410 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637969329 |
Product | Phage uncharacterized protein-like |
Protein accession | YP_576139 |
Protein GI | 92116410 |
COG category | [S] Function unknown |
COG ID | [COG5410] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.814229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATG AGATCGACCT CTTCAATGCG ATCCTGCGAA GCAACTTTGC CAGCTTCGTG CATCGTTCCG TCCGAACGTT AAATCCCGGA TCGGCCTTTA TTCCGAACTA CCACATTGAC GCGATTGCTT ATCAGCTTGA GCGGATCCGG CGAGGCGACA TCAACAGGCT CATCATTAAT CTGCCGCCTC GCTATCTCAA ATCGATCATA ACATCAGTGG CGTTTCCCGC TTTTCTTTTA GGCCACGATC CGAGGCGGCG GATCATTTGC TTAAGTTACG GCGCCGAATT GGCGGCAAAG CATGCGGCCG ATTGTCGGGC CATCATGATG TCGAATTGGT ATCGCAGAGC ATTCCCGCAA ATGATCGTCT CGCGGACTGC TGACTCCAAC ATCTACACCT CCGGGCAGGG CTTCCGTAAA ACCAGCTCAG TCGGGGGTGC CCTGACAGGT CTGGGAGGAG ACCTGTTTAT CATCGATGAC CCTCAAAAGC CTGTGGATGC ACAATCCCAG CCTTTGCGAG ATCAATTAAA CCAGTGGTTT TCCAATACGC TCGTGTCTCG CCTGGATAAC AAGGAAAAGA GCGCCATCAT TGTTGTCATG CAGCGCGTCC ATCTTGGCGA CCTGACCGGC TACCTCACCG AGAGCTCCGA TCGGTGGACT GTCCTCAGTC TTCCGGCCAT CGCTGAAGTC GACGAGCGCA TTCAAATCGG CGATTCGCGA TACTACGAGC GTCACGCCGG TGAAGCGCTG CACGAGAAAT ACGAATCCCT GGCCACTTTG GAGGATCTAC GTCGGGAAAT GGGGACGGAA GCGTTCAGTG CGCAGGATCA ACAAGCCCCT GTACCGCCGG GCGGCGCGAT GATCCAGCGG CGATGGCTTC GTTACTACGA TGTCCCTCCC GAAGGCTCGA GCAGCGCACG GATTATTCAG AGTTGGGATT GCGCGGGCAA GGAGGGCGCG CAAAATAGCT GGTCCGTGTG CACCACCTGG CTGGTCGATA AGGGTAAATA CTATTTATTG GACGTCACGC GAGGTCGCTT TGACTATCCG AAGCTGCGTC AGTCGACGCT CGCTTTAGCC GAAAGATACA ATCCGACAGC CATTCTTATC GAAGACGCGT CCGCAGGCAT CGCGCTCGCT CAGGAACTCC GACAGGCAGG GCGCTTCCGC GTTCAACCGA TTCCGGTCGA TCGTGACAAG GTCACGCGCC TCTACGTTCA GGCGGCCAAG TTCGAAGCTG GCCACGTTTA TTTCCCCAAG CAAGCGCCTT TCCTGGCTGA CCTCGAGGCC GAGTTGCTTA CATTTCCGCA AGGGAAGCAT GATGACCAGG TCGACAGCTT GACGCAGGCA CTCGCGTTCA ATGCTTCCAG ATATGATGAG ACGCTTTCCT GGGTCGGGTA A
|
Protein sequence | MKNEIDLFNA ILRSNFASFV HRSVRTLNPG SAFIPNYHID AIAYQLERIR RGDINRLIIN LPPRYLKSII TSVAFPAFLL GHDPRRRIIC LSYGAELAAK HAADCRAIMM SNWYRRAFPQ MIVSRTADSN IYTSGQGFRK TSSVGGALTG LGGDLFIIDD PQKPVDAQSQ PLRDQLNQWF SNTLVSRLDN KEKSAIIVVM QRVHLGDLTG YLTESSDRWT VLSLPAIAEV DERIQIGDSR YYERHAGEAL HEKYESLATL EDLRREMGTE AFSAQDQQAP VPPGGAMIQR RWLRYYDVPP EGSSSARIIQ SWDCAGKEGA QNSWSVCTTW LVDKGKYYLL DVTRGRFDYP KLRQSTLALA ERYNPTAILI EDASAGIALA QELRQAGRFR VQPIPVDRDK VTRLYVQAAK FEAGHVYFPK QAPFLADLEA ELLTFPQGKH DDQVDSLTQA LAFNASRYDE TLSWVG
|
| |