Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1876 |
Symbol | |
ID | 3705450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2132189 |
End bp | 2135095 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637738355 |
Product | hypothetical protein |
Protein accession | YP_343872 |
Protein GI | 77165347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAAT CTCTCGCAAC GAAAGCACCA AAAGGCAAAA GCGCCGAGCT AGCACTGAAG CGTCCGGCTG AGCATGCTCG CCAGCCTGCT GCCTCAACAG GTCCCTATGG CGAGGTACTG GCCTTGCAGC GCAGTGCGGG CAATCGCGCG GTCAGTCAGT TGCTTGAATC AGGAATGAGC GCATCACCAT TCTCCGGCGC TGTGCTCCAG CGCAAGTGTG CGTCGTGCGC CAACTCTGGT AGCGAATGCG CGGAATGCCG CAAGAAGCGC ACCTTCGCTC TGCAACCTAA GCTCACTATT AACGAACCGG GAGACCGGTA CGAGCAGGAA GCCGATCGCA TTGCCGCTCA GGTGATGACT ATGCCGGCAC AGCAGGCGGC CAGTAGCGCT CCTCCCCGTA TCCAGCGCCT CTCTGGAGAG GCGGTAGCGG TAGGCCGACC CAGTGTAGTA CCCGATAGCG TGCACCAAAC GCTTGCTAGT CCTGGTAGAC CGCTGGAGTC GACACTACGG CAGGATATGG GGCAGCGCTT TGGCTATGAT TTTTCTCGGG TGCGAGTGCA TACCAGCAGG GCTGCCGAGC AGTCGGCTCG CGACGTAAAC GCCCATGCTT ACACGGTGGG CCAAAACATC GTGTTCGGCG CCGGCCAGTT CGCACCGAGA ACACAGGAGG GACGACGGTT GTTGGCCCAT GAGTTGGCTC ATGTGGTGCA GCAGTCAGAT GCAGGCGAGC TGCATGCTGG CAAAAGTAGC AAGAAATGTA GTCTATCCCT GTCCGCTGAT TGTTCCGTCG CACGCAAACC GCGCACCGGT GCGACAGGCG CCTTGCCTTC GGTGGTGCAG GCCATGGCTC GGGAAGAGGC GCGTTCGGTG CTCGTAGCGT ACGTCACAGT GGCGGGAGTA GATGACGCAC TCGCCGCCAT GAACGCCATT CAAGAGACGC TGGACATGCC GTTCACCATG GAAAACGCGA GCATGCGCCT GCAACTCCTG ACCGCCGCAT TCAGCTTACT CGATGAGGAG GACGCTGCAA TCGTGCTTAA GGCACTGACT AAGCCCGTAG GCGCCGAGCA AAAACACTTG CACGAACGTT TCGGTAGGCT CGATTCGGAT TTCCGTAGAT TGTTGCTCGA CATCTTGCGC GAGCGGGCCG CTGCGAAGCC CGCGCCTGAG CCCGAGCAAG CAGAAGAGCC GGCTGCGGTC GCACCCACAG CGACATGGGT TGAGCTGCAC TCAGGCGTAT TCGCATACGT GCCAAACCGC GGAAAGACCC TCAAGCATGT CGCAGCATAC GTCTCGGGTC ACCCCAATGT GCCCGAAGCG CTCGGCCAGT TGAATGACCT TCCGCTAACG ACACCGATCC CGGAAGGGCA GTCGGTCATC ATTCCGATCG AATTTATTGA TCGCCCGAAG GCATTCCAGG AGATGCCCGA GGCTGTGCGG AGCCGCATCA TTTCGATGCG CAAGGCGCGG GCGCAGCAGG AGCGATATCT GCGATTCGTG CAGGTGAAGA GCGGGCACCC GCTGGGGCCT GGGGCCTCTG GACTATTCCC CGTCACCATG GCGCTCACCG AGGCGGCCAT AGAATCAATC GTCAGCGCAC TGAAGTCCCT GATCGAGAAG GTGGGATACG CGATCGCCTT CGCTGGTGGT GTGATCCATG GTTTCCTGAA GTCGATCTGG GATGCGGTTT CCGGCATCGC GAAGCTCATC TATGACGTCT TGAAGAGCAT CATTTCCCTC GAGCTTATCT CGGACGTGAA GAAGCTCGTG AGCTCGATCA AGAAGCTGAG CTGGGACAAG ATCACGGACG CGGTCGGTGA GTGGGCCGCC GGCTGGGTCG AAAAGCTCCA GTCCAAAAAC CCACTGGTCG CTGGGCATGC GCATGGCTAT CTAACTGGCT ATGTCATGGC CGAAGCGGCC ATGTTTTTGC TGACCGGCGG TCAAATAGCT GCGCTCAAGG GATCTATTTG GACCTCTAAG CTCGGCCAGG TGGTGAAGAC CTCTCGCGTC TTCAAGACTC TCGAAACTGC TGTTGCGAAG GCGAAGATCC TCCGCGCGGG GAGCCCTAAA TTCAACAAGG CCGTCGACAC GCTCAGGCAA TCGCGCCTTG GAACAGCCAT CAAGGCCGCG GAAGTGACCG GCGCCGCGGT CGTGTGGACC GCCGATAAAG TGGCAGCGGT GCTGAGGCTT CCCAGTAGTA TCGCCGGCTA CGTCGTCGAG AAGGCGGTGG CCCATGCCAA GCAACTGGAG CCTTTCTTCG AACGCATCGG CGAGTTGAGC GAACGCGCCA AGCGTTGGCT GTTCGGCTGC CGTTCGCCGT GCGAATGGGA GGCTGGTGTA GTGGCAAATA CGATGCAGCG GCTTACCAAC GACGAGATCG AGGGCGCCGC GAAATTCGCC GCAGAGGCGA AGGAGGCCAG GCGGGCGCGT GGTCCAGCCA CCTCCACCTC TGAAGAAGTC GTGAGGAGGG GCGCTGACGG TTCTATTGTC ATTCAGAGCG AAGTTGGTCC ACCTGCTAAA CGGAAAGATT ATGAGCGCAA ATTATTACCC GGAGTCAAAG TTAGGCTTAA AGGTTGGGAG CGAGCGCACA GCCAAGGTGC TGGAACAGGA GCTGAAGCGA AGGCAGGGAT TTTTTATGCT CCCCCGAAGG TAAATCAAGA ACTACAAAAT CGTGGCATAG AGAAATATAT ACGTGAATTA TATGTGAACA AGCCGGCAGA CGTGAAATTA TTTCTTACTA CAGAGACAAA GGCATACCCA GGAACGCTGC GCTTAAAGAC TATTAACTAC CGGGTTGAAG CAGAGCGTGG TGGACAAAGG CGCATACTTT TCGAGGCTTC TTTAGAGGTT GAAAACAAAC GCGCCAATCC GAAGGTTACC GTTGAGACAA CACCCTACGC CTTGTAA
|
Protein sequence | MAESLATKAP KGKSAELALK RPAEHARQPA ASTGPYGEVL ALQRSAGNRA VSQLLESGMS ASPFSGAVLQ RKCASCANSG SECAECRKKR TFALQPKLTI NEPGDRYEQE ADRIAAQVMT MPAQQAASSA PPRIQRLSGE AVAVGRPSVV PDSVHQTLAS PGRPLESTLR QDMGQRFGYD FSRVRVHTSR AAEQSARDVN AHAYTVGQNI VFGAGQFAPR TQEGRRLLAH ELAHVVQQSD AGELHAGKSS KKCSLSLSAD CSVARKPRTG ATGALPSVVQ AMAREEARSV LVAYVTVAGV DDALAAMNAI QETLDMPFTM ENASMRLQLL TAAFSLLDEE DAAIVLKALT KPVGAEQKHL HERFGRLDSD FRRLLLDILR ERAAAKPAPE PEQAEEPAAV APTATWVELH SGVFAYVPNR GKTLKHVAAY VSGHPNVPEA LGQLNDLPLT TPIPEGQSVI IPIEFIDRPK AFQEMPEAVR SRIISMRKAR AQQERYLRFV QVKSGHPLGP GASGLFPVTM ALTEAAIESI VSALKSLIEK VGYAIAFAGG VIHGFLKSIW DAVSGIAKLI YDVLKSIISL ELISDVKKLV SSIKKLSWDK ITDAVGEWAA GWVEKLQSKN PLVAGHAHGY LTGYVMAEAA MFLLTGGQIA ALKGSIWTSK LGQVVKTSRV FKTLETAVAK AKILRAGSPK FNKAVDTLRQ SRLGTAIKAA EVTGAAVVWT ADKVAAVLRL PSSIAGYVVE KAVAHAKQLE PFFERIGELS ERAKRWLFGC RSPCEWEAGV VANTMQRLTN DEIEGAAKFA AEAKEARRAR GPATSTSEEV VRRGADGSIV IQSEVGPPAK RKDYERKLLP GVKVRLKGWE RAHSQGAGTG AEAKAGIFYA PPKVNQELQN RGIEKYIREL YVNKPADVKL FLTTETKAYP GTLRLKTINY RVEAERGGQR RILFEASLEV ENKRANPKVT VETTPYAL
|
| |