Gene Noc_2092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2092 
Symbol 
ID3704952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2401223 
End bp2403088 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content60% 
IMG OID637738567 
ProductIntegrins alpha chain 
Protein accessionYP_344082 
Protein GI77165557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0373824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA AACATGCCTT TACTTCTACG TTCTGTCCTG ATTTCGCGGT GGAACTCGCG 
GCTGATGTCC CTCGCACCTT TGCGCTGCGG CCTTTGGCGG TGGCGCTTCG CCGGGTGCTG
GGCGGGGGGC TGCTGGCGGC GGGACTGATG GGTCCGGCCC TGGGGCAAAG CCCAGGCACG
GTCTTGAGCC CAGCTCCGGT CACGAATCAG AGCAAGAGTT TGGATCTGTC CACCCTTAAT
GGCGCCAATG GCTTTACGTT CGATGGTTGG GGCGGGTCAG TGAGCAGAGC GGGGGATGTG
AACGGGGATG GATTTGACGA TCTGGTGATT AGCGGCTGTT GCGTGGTGTT TGGGACCAGC
GGGGGATTTC CTGCGGCGTT GGATCCGTCC ACCCTGGATG GCAGTAATGG CTTTGTATTC
AATAGTCGGA CCATTTCAGT CAGTGGCGCG GGGGATGTGA ATGGGGACGG GTTTAATGAC
CTGGTGATTG GTGCGCCTGG TATTGGGATC AATGCTCTTA GCAGGGCGGG TCAGAGCTAC
GTGGTGTTTG GGGCGGGCGG GGGCTTTCCA GCAGTGTTGG AGCTCTCCAC CCTGGATGGG
AGCAACGGTT TTGCGCTCAA TGGTATCGCG GCCTCTAATG GCACGGGCCG GTCGGTGAGC
GGAGCGGGTG ATGTGAATGG GGACGGATTT GATGACCTGG TGATTGGTGC GCCTGGTATT
GGGATCAATG CTCTTAGCAG GGCGGGTCAG AGCTACGTGG TGTTTGGGGC GGGCGGGGGC
TTTCCAGCAG TGTTGGAGCT CTCCACCCTG GATGGGAGCA ACGGTTTTGC GCTCAATGGT
ATCGCGGCCT CTAATGGCAC GGGCCGGTCG GTGAGCGGAG CGGGTGATGT GAATGGGGAC
GGATTTGATG ACCTGGTGAT TGGCGCGCCT GGTGTCAGCC TCAACGATGT TAGCGGAGTG
GGCCAGAGCT ACGTGGTGTT TGGGACGGGC GGGGGCTTTC CAGCAGTGTT GGAGCTCTCC
ACCCTGGATG GGAGCAACGG TTTTACGCTC AACGGTATCG TTTTTACACT CAACGGCCTT
GGCCTTTACA GCTCTGAGGT TGGTGGCCGT TCAGGCTTTT CGGTGAGCGG AGCGGGGGAT
GTGAATGGGG ACGGGTTTGA TGACCTGGTG ATTGGCGCAC CCGATGCTGG CCCCAACGGT
GTTAGCGGAG CGGGCCAGAG CTATGTAGTG TTTGGGCGCA GCGGGGGTTT TCCCCCAGTG
CTTGATCTGT CCGCCCTGGA TGGGAGTAAC GGTTTTGTGC TCAACGGCAT CGTTTTTACG
CTCAACAACG GTCTTGGCCT TTACAGCTTT GAGGTTGGTG GCCACTCGGG CTACTCAGTG
AGTGGGGCGG GGGATGTGAA CAGGGACGGG TTTGATGATC TGGTGATTGG CGCGCCCTTT
ACCGGCTTCG GCGGCAATTA TTCGGGCCGG AGCTACGTGG TGTTTGGGAC GAATACGGGC
TTTCCTGCGG CGCTGGAGCT CTCCGCCCTG GATGGCAGCA AGGGATTTGC GCTCAACGGC
AGCGCAGCTG ATGACAGCTC GGGCCGGTCG GTGAGCGGAG CGGGGGATGT GAATGGGGAC
GGGTTCGATG ATATTGTGGT GGGCGGGGAA CACCAGAGTT ACGTGGTATT CGGGCGATCT
TCGGCCAGCG GCCCGGCGAC CTTGTTCAAT GGGCTGCTTA CCGACGTTGG CACTTTGAGT
TTGCCGGCAG GGCTGGAGCG CTGGCTGGCC AGAAGGTGCG GGGATGTATT CAACAGGTCA
GGACTTTGCA GCGGCTCAAG ATCATTCCAG AGATCGAAGC CACCGCCCCC ACCAGGGGGT
TTTTAA
 
Protein sequence
MNDKHAFTST FCPDFAVELA ADVPRTFALR PLAVALRRVL GGGLLAAGLM GPALGQSPGT 
VLSPAPVTNQ SKSLDLSTLN GANGFTFDGW GGSVSRAGDV NGDGFDDLVI SGCCVVFGTS
GGFPAALDPS TLDGSNGFVF NSRTISVSGA GDVNGDGFND LVIGAPGIGI NALSRAGQSY
VVFGAGGGFP AVLELSTLDG SNGFALNGIA ASNGTGRSVS GAGDVNGDGF DDLVIGAPGI
GINALSRAGQ SYVVFGAGGG FPAVLELSTL DGSNGFALNG IAASNGTGRS VSGAGDVNGD
GFDDLVIGAP GVSLNDVSGV GQSYVVFGTG GGFPAVLELS TLDGSNGFTL NGIVFTLNGL
GLYSSEVGGR SGFSVSGAGD VNGDGFDDLV IGAPDAGPNG VSGAGQSYVV FGRSGGFPPV
LDLSALDGSN GFVLNGIVFT LNNGLGLYSF EVGGHSGYSV SGAGDVNRDG FDDLVIGAPF
TGFGGNYSGR SYVVFGTNTG FPAALELSAL DGSKGFALNG SAADDSSGRS VSGAGDVNGD
GFDDIVVGGE HQSYVVFGRS SASGPATLFN GLLTDVGTLS LPAGLERWLA RRCGDVFNRS
GLCSGSRSFQ RSKPPPPPGG F