Gene Noc_1762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1762 
Symbol 
ID3704779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1981786 
End bp1983708 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content56% 
IMG OID637738245 
ProductIntegrins alpha chain 
Protein accessionYP_343764 
Protein GI77165239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000197793 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTGTA ACAGATCTTC TCGAATTTCC TCCTTTCCCC AACCCCACGC ACTGAGCTTG 
GCTGTTCGCC AGGCGCTGAC GCCACGCCAT GCCAGCTTTT GGGCGGGAGG CGTACTGATC
GCCGGGCTGA GTATGGGCGT TCAGGCCCAG ACTCTGACGT TGTCAGATTT GGACGGACAC
AATGGCTTCG TTATCCATGG CGCCAGTCTG AATAGCTCAC CCGGTATTGC GGTGAGCGGA
GTGGGAGATG TCAATGGCGA TGGGATCGAT GATCTCATCA TCGGGATTCC TGGCGCTGAT
TCCGGCAACG GTTTTTCGGG CGCCAGCTAC GTAGTCTTTG GGAGTAGCGG TGGTTTGGGC
CCTAGTCTGG AACTGTCGAG CCTGGATGGA AGTAACGGTT TTGCGATCAA GGGCGTTGGC
GCTTTTGACA ATGCCGGCAT TGCGGTGAGC GGAGTGGGAG ATGTCAATGG GGATGGGATC
GATGATCTCA TTGTGGGGGC CCCTGGAGTC GATTCTAACG GCAGCGGTTC GGGCGCGGGC
TATGTGGTTT TTGGAAGTAG CGGTGGTTTT GGCCCTAGTC TGGAACTGTC GAGCTTGGAT
GGGAGTAATG GTTTTGCGAT CAATGGCGCC GGGGCTTTTG AGAACGCCGG TATTTCGGTG
AGTGGGGCAG GGGATGTCAA TGGCGATGGC CTGAGTGATC TTATTATGGG CGCCTACGGC
GCCAGCCCTA ACGGCAGCGG CTCGGGCGCG GGCTATGTGG TCTTTGGAAG TAGCGGTGGT
TTTGGTCCTA GCCTGGAATT GTCGGGTTTG GATGGAAGCA ACGGCTTTGC TATTAATGGT
GTTGGCGCCT TTGATAGTGC CGGTATTTCG GTGAGTGGGG CGGGAGATGT CAATGGCGAC
GGGATCGATG ATCTCATTGT GGGGGCCCCT GACGCCTATA CTAACAGCGG CACCTCGGGC
GCGGGCTATG TGGTGTTTGG AAGCCGCAGG GGTTTTGCTC CTAGCCTGGA GTTATTGAAC
CTAAACGGGA ACAACGGCTT TGCTATTAAC GGCGTTGATA TCTTTGACAA CGCCGGCATT
TCGGTGAGCG GGATGGGGGA TATCAACGGC GATGGTCTGG GCGATCTGAT TGTCGGCGCC
TATGGCGCTG GCCCTAATGG TAGGGCCTCA GGCGCGAGCT ATGTAGTATT TGGAAGCCGC
AGCGGTTTTG CTCCTAGCCT GGAGTTGTCG AGTCTGAATG GAAGCAACGG CTTTGCCATT
GTTGGCGCTA ATCCCCGCGA CGCATCGGGC ATTTCGGTGA GCGGGGTGGG GGATGTTAGT
GGCGACGGCC TTAACGATTT CCTCATTGGC GCTCCGGGCG CCGCGCCTAA CGGCAATTTT
TCGGGCGCCA GCTACGTGGT GTTTGGAAAC AGCGTTGGTT TCGGCACCAG CCTGGAACTG
GCGGATTTGG ACGGGAACAA TGGCTTTGTG ATTAATGGCG CGAATGCTGG TGAAGCGTCC
GGCTTCTCGG TAAGCGGAGC GGGGGATGTG GATGGCGATG GTGCTGATGA TTTCATCATC
GGAGCCTACC GTTCGGGTAC GAGCTATGTG GTCTTTGGCA CGAGCGCCAC GGATATTGCC
CAATCGATGC TGATGGAAGT CAGCAATATC GTTTCGGACC TGCCAGCGGA AAGTTTCAGC
GGGCCGGAAA GCCTGGATAA GATTAACAAT AAGATGTCCA AGGCTGCCGA TGAGAGCCAG
CGCGAGGGGG TCCTGTTTTT CGTGGAGAAA CTCATTAGAG GGAACGACGG TTGTGCGCTG
CGTGGAGCGC CTGATCCTTT GGGCGATCTT GAGAAGGAAG ATTGGATTAT GAACTGCGAT
GACCAGACTC GGGTCTATGA CAAGCTGATC GAGGCCCGGG ATATTCTCAC ACCTTTGTTC
TAG
 
Protein sequence
MTCNRSSRIS SFPQPHALSL AVRQALTPRH ASFWAGGVLI AGLSMGVQAQ TLTLSDLDGH 
NGFVIHGASL NSSPGIAVSG VGDVNGDGID DLIIGIPGAD SGNGFSGASY VVFGSSGGLG
PSLELSSLDG SNGFAIKGVG AFDNAGIAVS GVGDVNGDGI DDLIVGAPGV DSNGSGSGAG
YVVFGSSGGF GPSLELSSLD GSNGFAINGA GAFENAGISV SGAGDVNGDG LSDLIMGAYG
ASPNGSGSGA GYVVFGSSGG FGPSLELSGL DGSNGFAING VGAFDSAGIS VSGAGDVNGD
GIDDLIVGAP DAYTNSGTSG AGYVVFGSRR GFAPSLELLN LNGNNGFAIN GVDIFDNAGI
SVSGMGDING DGLGDLIVGA YGAGPNGRAS GASYVVFGSR SGFAPSLELS SLNGSNGFAI
VGANPRDASG ISVSGVGDVS GDGLNDFLIG APGAAPNGNF SGASYVVFGN SVGFGTSLEL
ADLDGNNGFV INGANAGEAS GFSVSGAGDV DGDGADDFII GAYRSGTSYV VFGTSATDIA
QSMLMEVSNI VSDLPAESFS GPESLDKINN KMSKAADESQ REGVLFFVEK LIRGNDGCAL
RGAPDPLGDL EKEDWIMNCD DQTRVYDKLI EARDILTPLF