Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2092 |
Symbol | |
ID | 3704952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2401223 |
End bp | 2403088 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637738567 |
Product | Integrins alpha chain |
Protein accession | YP_344082 |
Protein GI | 77165557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0373824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATA AACATGCCTT TACTTCTACG TTCTGTCCTG ATTTCGCGGT GGAACTCGCG GCTGATGTCC CTCGCACCTT TGCGCTGCGG CCTTTGGCGG TGGCGCTTCG CCGGGTGCTG GGCGGGGGGC TGCTGGCGGC GGGACTGATG GGTCCGGCCC TGGGGCAAAG CCCAGGCACG GTCTTGAGCC CAGCTCCGGT CACGAATCAG AGCAAGAGTT TGGATCTGTC CACCCTTAAT GGCGCCAATG GCTTTACGTT CGATGGTTGG GGCGGGTCAG TGAGCAGAGC GGGGGATGTG AACGGGGATG GATTTGACGA TCTGGTGATT AGCGGCTGTT GCGTGGTGTT TGGGACCAGC GGGGGATTTC CTGCGGCGTT GGATCCGTCC ACCCTGGATG GCAGTAATGG CTTTGTATTC AATAGTCGGA CCATTTCAGT CAGTGGCGCG GGGGATGTGA ATGGGGACGG GTTTAATGAC CTGGTGATTG GTGCGCCTGG TATTGGGATC AATGCTCTTA GCAGGGCGGG TCAGAGCTAC GTGGTGTTTG GGGCGGGCGG GGGCTTTCCA GCAGTGTTGG AGCTCTCCAC CCTGGATGGG AGCAACGGTT TTGCGCTCAA TGGTATCGCG GCCTCTAATG GCACGGGCCG GTCGGTGAGC GGAGCGGGTG ATGTGAATGG GGACGGATTT GATGACCTGG TGATTGGTGC GCCTGGTATT GGGATCAATG CTCTTAGCAG GGCGGGTCAG AGCTACGTGG TGTTTGGGGC GGGCGGGGGC TTTCCAGCAG TGTTGGAGCT CTCCACCCTG GATGGGAGCA ACGGTTTTGC GCTCAATGGT ATCGCGGCCT CTAATGGCAC GGGCCGGTCG GTGAGCGGAG CGGGTGATGT GAATGGGGAC GGATTTGATG ACCTGGTGAT TGGCGCGCCT GGTGTCAGCC TCAACGATGT TAGCGGAGTG GGCCAGAGCT ACGTGGTGTT TGGGACGGGC GGGGGCTTTC CAGCAGTGTT GGAGCTCTCC ACCCTGGATG GGAGCAACGG TTTTACGCTC AACGGTATCG TTTTTACACT CAACGGCCTT GGCCTTTACA GCTCTGAGGT TGGTGGCCGT TCAGGCTTTT CGGTGAGCGG AGCGGGGGAT GTGAATGGGG ACGGGTTTGA TGACCTGGTG ATTGGCGCAC CCGATGCTGG CCCCAACGGT GTTAGCGGAG CGGGCCAGAG CTATGTAGTG TTTGGGCGCA GCGGGGGTTT TCCCCCAGTG CTTGATCTGT CCGCCCTGGA TGGGAGTAAC GGTTTTGTGC TCAACGGCAT CGTTTTTACG CTCAACAACG GTCTTGGCCT TTACAGCTTT GAGGTTGGTG GCCACTCGGG CTACTCAGTG AGTGGGGCGG GGGATGTGAA CAGGGACGGG TTTGATGATC TGGTGATTGG CGCGCCCTTT ACCGGCTTCG GCGGCAATTA TTCGGGCCGG AGCTACGTGG TGTTTGGGAC GAATACGGGC TTTCCTGCGG CGCTGGAGCT CTCCGCCCTG GATGGCAGCA AGGGATTTGC GCTCAACGGC AGCGCAGCTG ATGACAGCTC GGGCCGGTCG GTGAGCGGAG CGGGGGATGT GAATGGGGAC GGGTTCGATG ATATTGTGGT GGGCGGGGAA CACCAGAGTT ACGTGGTATT CGGGCGATCT TCGGCCAGCG GCCCGGCGAC CTTGTTCAAT GGGCTGCTTA CCGACGTTGG CACTTTGAGT TTGCCGGCAG GGCTGGAGCG CTGGCTGGCC AGAAGGTGCG GGGATGTATT CAACAGGTCA GGACTTTGCA GCGGCTCAAG ATCATTCCAG AGATCGAAGC CACCGCCCCC ACCAGGGGGT TTTTAA
|
Protein sequence | MNDKHAFTST FCPDFAVELA ADVPRTFALR PLAVALRRVL GGGLLAAGLM GPALGQSPGT VLSPAPVTNQ SKSLDLSTLN GANGFTFDGW GGSVSRAGDV NGDGFDDLVI SGCCVVFGTS GGFPAALDPS TLDGSNGFVF NSRTISVSGA GDVNGDGFND LVIGAPGIGI NALSRAGQSY VVFGAGGGFP AVLELSTLDG SNGFALNGIA ASNGTGRSVS GAGDVNGDGF DDLVIGAPGI GINALSRAGQ SYVVFGAGGG FPAVLELSTL DGSNGFALNG IAASNGTGRS VSGAGDVNGD GFDDLVIGAP GVSLNDVSGV GQSYVVFGTG GGFPAVLELS TLDGSNGFTL NGIVFTLNGL GLYSSEVGGR SGFSVSGAGD VNGDGFDDLV IGAPDAGPNG VSGAGQSYVV FGRSGGFPPV LDLSALDGSN GFVLNGIVFT LNNGLGLYSF EVGGHSGYSV SGAGDVNRDG FDDLVIGAPF TGFGGNYSGR SYVVFGTNTG FPAALELSAL DGSKGFALNG SAADDSSGRS VSGAGDVNGD GFDDIVVGGE HQSYVVFGRS SASGPATLFN GLLTDVGTLS LPAGLERWLA RRCGDVFNRS GLCSGSRSFQ RSKPPPPPGG F
|
| |