Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1762 |
Symbol | |
ID | 3704779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1981786 |
End bp | 1983708 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637738245 |
Product | Integrins alpha chain |
Protein accession | YP_343764 |
Protein GI | 77165239 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000197793 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTGTA ACAGATCTTC TCGAATTTCC TCCTTTCCCC AACCCCACGC ACTGAGCTTG GCTGTTCGCC AGGCGCTGAC GCCACGCCAT GCCAGCTTTT GGGCGGGAGG CGTACTGATC GCCGGGCTGA GTATGGGCGT TCAGGCCCAG ACTCTGACGT TGTCAGATTT GGACGGACAC AATGGCTTCG TTATCCATGG CGCCAGTCTG AATAGCTCAC CCGGTATTGC GGTGAGCGGA GTGGGAGATG TCAATGGCGA TGGGATCGAT GATCTCATCA TCGGGATTCC TGGCGCTGAT TCCGGCAACG GTTTTTCGGG CGCCAGCTAC GTAGTCTTTG GGAGTAGCGG TGGTTTGGGC CCTAGTCTGG AACTGTCGAG CCTGGATGGA AGTAACGGTT TTGCGATCAA GGGCGTTGGC GCTTTTGACA ATGCCGGCAT TGCGGTGAGC GGAGTGGGAG ATGTCAATGG GGATGGGATC GATGATCTCA TTGTGGGGGC CCCTGGAGTC GATTCTAACG GCAGCGGTTC GGGCGCGGGC TATGTGGTTT TTGGAAGTAG CGGTGGTTTT GGCCCTAGTC TGGAACTGTC GAGCTTGGAT GGGAGTAATG GTTTTGCGAT CAATGGCGCC GGGGCTTTTG AGAACGCCGG TATTTCGGTG AGTGGGGCAG GGGATGTCAA TGGCGATGGC CTGAGTGATC TTATTATGGG CGCCTACGGC GCCAGCCCTA ACGGCAGCGG CTCGGGCGCG GGCTATGTGG TCTTTGGAAG TAGCGGTGGT TTTGGTCCTA GCCTGGAATT GTCGGGTTTG GATGGAAGCA ACGGCTTTGC TATTAATGGT GTTGGCGCCT TTGATAGTGC CGGTATTTCG GTGAGTGGGG CGGGAGATGT CAATGGCGAC GGGATCGATG ATCTCATTGT GGGGGCCCCT GACGCCTATA CTAACAGCGG CACCTCGGGC GCGGGCTATG TGGTGTTTGG AAGCCGCAGG GGTTTTGCTC CTAGCCTGGA GTTATTGAAC CTAAACGGGA ACAACGGCTT TGCTATTAAC GGCGTTGATA TCTTTGACAA CGCCGGCATT TCGGTGAGCG GGATGGGGGA TATCAACGGC GATGGTCTGG GCGATCTGAT TGTCGGCGCC TATGGCGCTG GCCCTAATGG TAGGGCCTCA GGCGCGAGCT ATGTAGTATT TGGAAGCCGC AGCGGTTTTG CTCCTAGCCT GGAGTTGTCG AGTCTGAATG GAAGCAACGG CTTTGCCATT GTTGGCGCTA ATCCCCGCGA CGCATCGGGC ATTTCGGTGA GCGGGGTGGG GGATGTTAGT GGCGACGGCC TTAACGATTT CCTCATTGGC GCTCCGGGCG CCGCGCCTAA CGGCAATTTT TCGGGCGCCA GCTACGTGGT GTTTGGAAAC AGCGTTGGTT TCGGCACCAG CCTGGAACTG GCGGATTTGG ACGGGAACAA TGGCTTTGTG ATTAATGGCG CGAATGCTGG TGAAGCGTCC GGCTTCTCGG TAAGCGGAGC GGGGGATGTG GATGGCGATG GTGCTGATGA TTTCATCATC GGAGCCTACC GTTCGGGTAC GAGCTATGTG GTCTTTGGCA CGAGCGCCAC GGATATTGCC CAATCGATGC TGATGGAAGT CAGCAATATC GTTTCGGACC TGCCAGCGGA AAGTTTCAGC GGGCCGGAAA GCCTGGATAA GATTAACAAT AAGATGTCCA AGGCTGCCGA TGAGAGCCAG CGCGAGGGGG TCCTGTTTTT CGTGGAGAAA CTCATTAGAG GGAACGACGG TTGTGCGCTG CGTGGAGCGC CTGATCCTTT GGGCGATCTT GAGAAGGAAG ATTGGATTAT GAACTGCGAT GACCAGACTC GGGTCTATGA CAAGCTGATC GAGGCCCGGG ATATTCTCAC ACCTTTGTTC TAG
|
Protein sequence | MTCNRSSRIS SFPQPHALSL AVRQALTPRH ASFWAGGVLI AGLSMGVQAQ TLTLSDLDGH NGFVIHGASL NSSPGIAVSG VGDVNGDGID DLIIGIPGAD SGNGFSGASY VVFGSSGGLG PSLELSSLDG SNGFAIKGVG AFDNAGIAVS GVGDVNGDGI DDLIVGAPGV DSNGSGSGAG YVVFGSSGGF GPSLELSSLD GSNGFAINGA GAFENAGISV SGAGDVNGDG LSDLIMGAYG ASPNGSGSGA GYVVFGSSGG FGPSLELSGL DGSNGFAING VGAFDSAGIS VSGAGDVNGD GIDDLIVGAP DAYTNSGTSG AGYVVFGSRR GFAPSLELLN LNGNNGFAIN GVDIFDNAGI SVSGMGDING DGLGDLIVGA YGAGPNGRAS GASYVVFGSR SGFAPSLELS SLNGSNGFAI VGANPRDASG ISVSGVGDVS GDGLNDFLIG APGAAPNGNF SGASYVVFGN SVGFGTSLEL ADLDGNNGFV INGANAGEAS GFSVSGAGDV DGDGADDFII GAYRSGTSYV VFGTSATDIA QSMLMEVSNI VSDLPAESFS GPESLDKINN KMSKAADESQ REGVLFFVEK LIRGNDGCAL RGAPDPLGDL EKEDWIMNCD DQTRVYDKLI EARDILTPLF
|
| |