Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_3126 |
Symbol | |
ID | 4032312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 3446159 |
End bp | 3449110 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637971540 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_578322 |
Protein GI | 92118593 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.389263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGATCA AGAGATCACA GCGACAGAAT GCGCAAATCG CGCGCGGCGG TTCGGCTCTC GGAACGCTGA CCGAGCCGGG AAAAGGCTTC GACCGCCGCA CGTTCCTGCG CCGTTCCGGT CTGGCCGCCG GCGGTCTTGC CGCGTTGAGC ACGCTGCCGC TCGGCACCGT CCGCAAGGCC GAAGCGGCGG CTGCGGGGCC GCTGACGTCG GGCGCGACGG TTCGTAAAAG CGTGTGCACG CATTGCGCCG TCGGCTGCAC CGTCACCGCC GAGGTGCTGA ACGGTGTCTG GATCGGCCAG GAGCCGAGCT GGGATTCGCC GATCAATCGC GGCTCGCACT GCGCCAAGGG CGCATCGGTG CGCGAACTCG TCCATAACGA GCGCCGCCTC CGCTACCCGA TGAAACTTGT CAACGGACAA TGGACGCGCG TTTCGTGGGA TACGGCCATC GACGAGATCG GCGACAAGTT AGTGCAGGTC CGCCAGAAGT CGGGCGGCGA CTCGGTCTAT TGGCTCGGTT CGGCGAAGAT GACGAACGAA GGGTCGTATC TGTTCCGCAA GCTCGGTGCG TTCTGGGGCA CCAACAACAC CGATCACCAG GCGCGCATCT GCCATTCGAC AACTGTGACC GGCGTGGCCA ACACCTGGGG CTACGGCGCG ATGACCAACA GCTTCAACGA TATTCGTAAC TCCAAAACGC AGATCATCAT GGGCGGCAAT CCCGCCGAAG CGCATCCGGT TTCGCTACAA CATCTTCTTG AAGGCGCCGA ATTGAACAAG GCGAATGTCG TCGTGATCGA TCCGCGCATG ACGCGGACCG CCGCACATGC GACCGAGTAC GTGCGGCTGC GGCCGGGAAC AGACATTCCG GTTCTGTACG GAATGATGTG GCACATCCTC AAGAATGGCT GGGAAGACAA GGAATTCATC CGGCAACGCG TTTACGGTTT CGACGATCTT CGCAAGGAAG CGGAGAAGTG GAATCCGGAG GAAGTCGAAC GCGTCAGTGG CGTTCCGGGC GCGCAGCTTG AGCGCGTCGC CAAGATGTTC GCGACGGAAA AGCCGGCGAC GTTGATTTGG TGCATGGGGC AGACCCAGCA TACGGTCGGC ACCGCGAATG TGCGCGCGAG TTGCATCGCC TTGCTGCTGA CCGGCAATGT CGGCAAACCC GGCACCGGCG CTAACATCTT CCGCGGCCAT GACAACGTGC AGGGCGCGAC CGACGTCGGG CTCGATATCG TGACGCTGCC TTTCTACTAC GGCCTGGCCG AAGGCGCCTG GAAGCACTGG TCGCGTGTCT GGGAGGTCGA ATACGACTAT CTGGTGTCAC GTTTCGACGA CAAGAAATCG ATGGAAACGC CCGGCATTCC GCTGACGCGG TGGTTCGACG CCGTTATCCT TCCAAAAGCC GACGTGGCCC AGAAAGACAA TGTGAAGGCG GTGTTCGTGC AGGGACACGC CAGCAACAGC ATTACGCGAA TCCCCGAATC GATGAAGGGA CTAAAGGCGC TGGAATTGCT CGTCATCGCC GACCCGCATC CGACCACATG GGCTTCGCTC GCGGTACAAG CCGGCCGCAA GGACGGTGTT TATCTTCTGC CTGTCGCCAC GCAGTTCGAA TGCAAGGGCT CGCGCGTCGC CTCGAATCGC TCGCTGCAAT GGGGTGAGCA GATCGTGAAG CCGGTCTTCG AGTCGAAGGA CGACCTCGAG GTCATGTATC TTGTCGCCAA AAAGCTCGGC TTTGCAGACA AGCTGTTCAA GAACATCAAG GTCGAGGACA ACCTGCCGGT GGCGGAGGAT ATCCTTCGCG AGATGAATCG CGGGAGCTGG TCGACCGGCT ATTGCGGCCA ATCGCCGGAA CGCCTCAAGG CGCACATGAA AAACCAGAAC AAGTTCGATC TGGTCACCAT GCGCGCGCCA AAGGACGATC CGGAAGTCGG TGGGGACTAT TACGGCTTGC CGTGGCCGTG CTGGGGTTCG CCGGAGGTGC GGCATCCGGG CACGCCCCTG CTCTACAACA CCAATCTTGC GGTCATGGAC GGCGGCGGTT GCTTCCGCCC GCGTTTCGGA CTCGAGCGCG AGGAGAAGCT GCCCGATGGC AGCACCCGCA AGGTCAGCCT GCTCGCGGAC GGTTCCTATT CGAAGGATTC GGAAATCAAG GACGGCTATC CCGAATTCAC GCTGGCGAGC TTGAAGAAGC TTGGCTGGGA CAAGGATCTG ACCGAGAGCG AGATGGCCAC GATCAACAAG GTCAATCCCG ACAAACCGGA TACGGTGTCC TGGGCGCTCG ATCTCTCGGG CGGTATTCAG CGCGTCGCGC TGATGCACGG CTGCGTTCCC TACGGCAACG GCAAGGCGCG CATGAACGCC TTCGGTCTGC CGGATCCGAT CCCGGTGCAT CGCGAACCCA TTTACTCTCC GCGTGTCGAT CTGGTGTCGA AATATCCGAC GTTGCCGGAC GCCAAGCAGT TCCGCTTACC CAACATCGGC TTCTCCGTGC AGAAGGCCGC GGTCGAGAAA GGAATCGCCA AGCAGTTTCC GCTCGTCCTC TCCTCAGGCC GCCTGGTCGA ATATGAGGGC GGCGGCGAGG AGTCGCGGAG CAATCCGTGG CTCGCCGAAT TGCAGCAGGA CATGTTCATC GAGATCAGCG TCGCGGACGC CGCCGAACGC GGCATCAAGG ACGGCGGCTG GGTCTGGGTG ACGGGCGCCG AGAACAGCTC GAAAGCGAGG ATGAAGGCGC TGGTGACCGA GCGGGTCGGC AAGGGCGTGG CGTGGATGCC GTTCCACTTC GGCGGATGGT TCGGAGGCGT CGATTTGCGC AACAACTACC CGAAGGGGAC CGATCCGATC GTGCTGGGCG AGAGTGCCAA TACGATCACG ACCTACGGCT ACGATCCTGC AACGGGCATG CAGGAACCGA AGGTCACGCT CTGTCAGATC GTCGCGGCAT AA
|
Protein sequence | MLIKRSQRQN AQIARGGSAL GTLTEPGKGF DRRTFLRRSG LAAGGLAALS TLPLGTVRKA EAAAAGPLTS GATVRKSVCT HCAVGCTVTA EVLNGVWIGQ EPSWDSPINR GSHCAKGASV RELVHNERRL RYPMKLVNGQ WTRVSWDTAI DEIGDKLVQV RQKSGGDSVY WLGSAKMTNE GSYLFRKLGA FWGTNNTDHQ ARICHSTTVT GVANTWGYGA MTNSFNDIRN SKTQIIMGGN PAEAHPVSLQ HLLEGAELNK ANVVVIDPRM TRTAAHATEY VRLRPGTDIP VLYGMMWHIL KNGWEDKEFI RQRVYGFDDL RKEAEKWNPE EVERVSGVPG AQLERVAKMF ATEKPATLIW CMGQTQHTVG TANVRASCIA LLLTGNVGKP GTGANIFRGH DNVQGATDVG LDIVTLPFYY GLAEGAWKHW SRVWEVEYDY LVSRFDDKKS METPGIPLTR WFDAVILPKA DVAQKDNVKA VFVQGHASNS ITRIPESMKG LKALELLVIA DPHPTTWASL AVQAGRKDGV YLLPVATQFE CKGSRVASNR SLQWGEQIVK PVFESKDDLE VMYLVAKKLG FADKLFKNIK VEDNLPVAED ILREMNRGSW STGYCGQSPE RLKAHMKNQN KFDLVTMRAP KDDPEVGGDY YGLPWPCWGS PEVRHPGTPL LYNTNLAVMD GGGCFRPRFG LEREEKLPDG STRKVSLLAD GSYSKDSEIK DGYPEFTLAS LKKLGWDKDL TESEMATINK VNPDKPDTVS WALDLSGGIQ RVALMHGCVP YGNGKARMNA FGLPDPIPVH REPIYSPRVD LVSKYPTLPD AKQFRLPNIG FSVQKAAVEK GIAKQFPLVL SSGRLVEYEG GGEESRSNPW LAELQQDMFI EISVADAAER GIKDGGWVWV TGAENSSKAR MKALVTERVG KGVAWMPFHF GGWFGGVDLR NNYPKGTDPI VLGESANTIT TYGYDPATGM QEPKVTLCQI VAA
|
| |