Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_0819 |
Symbol | |
ID | 4032085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 901769 |
End bp | 904861 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637969345 |
Product | hypothetical protein |
Protein accession | YP_576155 |
Protein GI | 92116426 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02243] conserved hypothetical protein, phage tail-like region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.984927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGG GTGCGACATC CGCCACCTGC GCGGCCTGCG GCGCGCGCAT GGATTGCGGC TGCGAGTCCA ACTGCAATGG TGCGGAAACG CTTTCGCTCG GGGCGGGCGC GAACCGACCG GGCCTGGCGC GCATCGATAC GCGCATCGGC GGTCATGGCG CCTTCTTTGC GGCCGCGACA CGTGCGCTGT CATCTGCAGA CGCACCCGCG TTGCGCACGT TCGGAACCCG CGAGACGAAC GACCCTGCCA TCGCCCTTCT CGACGGCTGG GCGATGATCG CGGACGTCCT GACGTTTTAC CGCGACCGGT TCTCGAACGA GACTTATTTG CGCACGGCGC GCGAGGAACG GTCGCTGCGG GAACTCGCGG CGCAGGTCGG CTATGCTCCG CGCCCCGGCG TGTCGGCCAG CGTTGCCCTC GCCTATCTGC TCGATCCCGG CGCGCAGCCG GTCATGATTT CCGCAGGCGC CCGGGTCCAG TCGGTGCCAA AGCCAGGCGA GCAGATGCAG AGCTTCGAGA CGATACAGCC GCTTGAAGCT CGCGCCGAAT GGAGCGAGAT GGCGCCGCGA CAGAACTGGC TGCCGCCGAT CGATCGCATC GATGCGCTGC TGCGCAGCAC CTTGCAACTC GCCGGTGTGC AGACGGTCGC GCGTCCGGGC GAGCGGATGC TAATGGTGTT CGGATCGAAA CAAGGCTGGC AGGTGGTGCG CGAGGTCGCG GCCGTGAAGG TGAATATCGC CGCAGACTGC GTCCTTCTCA CACTGAAGCC GCGCCTCGGC CTCACCGCCA AACTCGCCGA CCGTCTGCTC GCCTATCTCG ACAAGGCCGT CGCGGTGTCC GGCGACCCGA GCACCGCCAG CGCGATCCGC GGCATCGCCT CCTATTTTCT CGGCGCCTCG GCGTACGACG CCAAACGCGT GGTGTCGGAC AGGGAGCGCG AGCTCTACGA GATTTTCGAT GAGATCGTGA AGGCGCCGCA CGCCGAGGTG TCGGGCCGGC GAGGCGCCAG CCTCGACGCC ATTCTCGCGG GGGCCAGGAC ACTCGCCGGG CGCGTCGCGG CGGGCGGACC TCGCGCCGTC ACGGATACGC TCGGTCCGGC GGGCCTAACC CGCACTGCGC TTCTCACGAG GGACTCACCC GCCGTCGGCG CCGCGCTCTA TTCGGCCTGG AGCAGGCTGC CCGTCAACGA CGTGACGGTG AGCCACGCGC CCGATTGCTA TCTGCTGCGC CTCGTCGCCG GCGCATTCGG CAGCAACGCG CCGGCGCTCA TGCCCGACGG CAAGGGAGAA GTGGATGACG TATCGATCGC GCCGATCGAC CGGGATCACG TCTTCCTCGA TGCGCTTGCG GACGGCGTGC AGAGCGGCGG CTTCGCCCTG ATGGACGCCC CGGCGACGAG CGAATCCGAC ACGCGACTTC TTCGCCTTGG TCGGGTGCGC GAGGTCCAGG CGACAGCGCG GTCCAACTAC GGGTTGACTG CGCGAGTCAC GCGGCTCGAC GTGGTGGGTC TCGAGAACGA CGAGCCGCTC GGCTTTCCCG AAATCATGAA CGAGCGTGGC GGCACGGTGC GGCTGTTGCG CCAGATCCTC TACGCCGTCC AGAGCGAGGC GGTGACGCTG GCGCCCGAGC CGCTCGACGA CCTCGTCGAG GGCGATACGA TCATGCTCGA CAGCCTCTAT CCGGATCTCG CGCCGGGCCG TGCGATCGTG GTCAAAGGCG AACGCGCCGA CATCATGGTG GGCGACAACG TCGTGCCGGG CGTGGCCGGA GGCGAACAGG CGAAGGTCGC CGCGGTCACG CAGGAAGCGA TCAAGGGATC GCCCGGCGAC ACGCCGCACA CGGTGCTGCG CCTCACCGCT CCGCTGACCT TCCGGTACCG GCGCGCGGCG ACGACCGTCC ATGGCAATGT CGTTATCGCC ACGCACGGCG AAACGGTGCA GGAAACACTG GGCTCCGGCG ATGCGAGCCA GGCGTTCCAG CGGTTCGCCT TGCGCCGCAA GCCCCTGACC TTCGTCGCGG CGCCGACGAC GAGCGGGGTC GCGGACACCT TGCGGCTGGA GGTCAATGGC ATCGCCCTAG GCCAGGTCGA CGCGCTGATC GACGCCGGTC CCAACGAGCA TGTCTATCAG CTCAGCGTCG ATGCCGGCGG CGCGGGCACC GCGGAGAGCG GTGACGGCAA GACCGGCATG CGCTTGCCGA CCGGCGCGGA GAATGTCCGC GCGACCTATC GCGTCGGCAT CGGCGCCGTC GGCAACGTCG ACCTCGGTCA GATCAGCCTC GCCACCACCC GTCCGCTTGG CGTGCGCGGC GTCGTCAACC CGTTGTCTGC GAGCGGCGGC GCCGATCGCG ACGGCGCCGA AGCGATCCGG CGCAATACGC CTGTCCCCAC CCTCGCCCTT TCGCCGCAGT CGCGGCTCGT CAGTGTCGAG GACTACGAAC ATTTCGCGCG CGCCTTTGCC GGCATCGGCG ATGTGCGCGC GGCAATGCTC TCCGACGGAC GCCACCGCAC CGTCTTCGTG ACAATCGCCG GCATCGACGA CGCTCCAATC GCGCCCGACG ACCTGCTCAC CGGCACGCTG ATCGACGCCT ATGCGCGCTA TGGCGATCCG GCGCTGCCGG TGGTCGTCGC GGTCCGCGAA CGCGTGACCC TGCTCGTGCA GGCGAATGTC GCGTTCGCGC CGGAGATGGA CCGAGTCGCC GTCGACGCGG CCGTGCGCGC GCTGCTGGCC GAGGCCTTCT CGTTTGCGCG GCGGGGCCTT GCGCGCCCTG CTTATCGCTC GGAGCTGATC GCGCTCATCC AGGGCTCGCC GGGCGTCGAT CATGTCGATG TCGATGTCTT CGGCGGCATC TCCGACGCGG TCCTTCAGGA CCCGGTCCGC CTTGCTAATG CGGTGCACGA CCTGCGCGAG CAGGCGCAAG AGGGGCGTCC GCTGGTCTTC GTCCCCGCCG CGCCCGCGCG CGTCGCGCCG CTGGCGCATT CAGGATTGCA TATTCAAGGG CCGGAAAGCG CCTCATCGGA GCGTCTGCTC CCAGCGCAGA CGACCTATTT GCGGCCGGAC GTTCCCGGCA CGCTGATCCT CAACTGGTCG TGA
|
Protein sequence | MSAGATSATC AACGARMDCG CESNCNGAET LSLGAGANRP GLARIDTRIG GHGAFFAAAT RALSSADAPA LRTFGTRETN DPAIALLDGW AMIADVLTFY RDRFSNETYL RTAREERSLR ELAAQVGYAP RPGVSASVAL AYLLDPGAQP VMISAGARVQ SVPKPGEQMQ SFETIQPLEA RAEWSEMAPR QNWLPPIDRI DALLRSTLQL AGVQTVARPG ERMLMVFGSK QGWQVVREVA AVKVNIAADC VLLTLKPRLG LTAKLADRLL AYLDKAVAVS GDPSTASAIR GIASYFLGAS AYDAKRVVSD RERELYEIFD EIVKAPHAEV SGRRGASLDA ILAGARTLAG RVAAGGPRAV TDTLGPAGLT RTALLTRDSP AVGAALYSAW SRLPVNDVTV SHAPDCYLLR LVAGAFGSNA PALMPDGKGE VDDVSIAPID RDHVFLDALA DGVQSGGFAL MDAPATSESD TRLLRLGRVR EVQATARSNY GLTARVTRLD VVGLENDEPL GFPEIMNERG GTVRLLRQIL YAVQSEAVTL APEPLDDLVE GDTIMLDSLY PDLAPGRAIV VKGERADIMV GDNVVPGVAG GEQAKVAAVT QEAIKGSPGD TPHTVLRLTA PLTFRYRRAA TTVHGNVVIA THGETVQETL GSGDASQAFQ RFALRRKPLT FVAAPTTSGV ADTLRLEVNG IALGQVDALI DAGPNEHVYQ LSVDAGGAGT AESGDGKTGM RLPTGAENVR ATYRVGIGAV GNVDLGQISL ATTRPLGVRG VVNPLSASGG ADRDGAEAIR RNTPVPTLAL SPQSRLVSVE DYEHFARAFA GIGDVRAAML SDGRHRTVFV TIAGIDDAPI APDDLLTGTL IDAYARYGDP ALPVVVAVRE RVTLLVQANV AFAPEMDRVA VDAAVRALLA EAFSFARRGL ARPAYRSELI ALIQGSPGVD HVDVDVFGGI SDAVLQDPVR LANAVHDLRE QAQEGRPLVF VPAAPARVAP LAHSGLHIQG PESASSERLL PAQTTYLRPD VPGTLILNWS
|
| |