Gene Noc_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2094 
Symbol 
ID3704954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2405280 
End bp2407400 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content57% 
IMG OID637738569 
ProductIntegrins alpha chain 
Protein accessionYP_344084 
Protein GI77165559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0736075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACC AAGATAACTC CATTTCTTTT TCCGTTTCGT GTCACCCTGA TTCTGCGGTG 
GAATTCTCCG CCGTTATGCC TTGCGCTCCT GCGCCGCGGC CTTTGGCGGT GGCCCTTCGC
CGGGTGCTGG GCGGAGGCTT GCTGGCTGTA GGGCTGGCGG GTCCGGCCCT GAGCCAGAAT
CAGGACCCGG TCTTGAGCGC GGTTCCGGTC CCGAATCAGA GCAAGAGCCT GGAACTGTCC
ACTCTGGATG GCAGCAATGG CTTTGTATTC AATAGTCCGG CCACTTCCGT AAGCGGTGCT
GGGGATGTGA ATGGCGACGG GTTTGATGAT CTGGTGTTTG GCAACCCCTA TGCCTCTCCC
AATGGCCTTG ATAGTGCGGG CCAGAGCTAT GTAGTGTTTG GGACGGGTGG AGATTTTCCC
GCCGCGCTAA GTTCCTCCGA CCTTAATGGC GATAACGGCT TTACCCTCAA TGGTATTGCG
ACCTATAGCT ACTCGGGCCT CCCTGATAAA CTGGGCTCTT CGGTGAGCAG TGCGGGGGAT
ATGAACGGCG ATGGGTTTGA TGACATCCTT GTCGGCGCGT CCGGTGTCCA TACCTTCAAA
AATGGCGATA TAACGGGCCA GAGCTACGTG GTGTTTGGGA CCAGCGGGGG CTTTCCCCCG
GCGCTGGAGC GCTCAGATCT TGGTGGCAGC AATGGTTTTG TGATCCGCAA CATCTTGTCC
GGTGATTACT CGGGCTTTTC TGTGAGCGGT GCGGGGGATA TAGACGGCGA TGGATTTGAT
GATGTCCTTA TCGGCGCCAA AAATCTCGGG AGATCGGGAG ACTATAGTGT GGGCTATGCG
GATGAAACCT ACGTGATATT CGGAGACAGT GACGGAACCT CAGGTAATAA AATCCTTGCT
GATACCACTA ATGCCTACAG CGAAAGCTTT ACTACCTCGG TAAGTGGTGC GGGGGATGTG
AATGGGGACG GACTTAATGA CATGTTGCTC AGCACGTCTG GTTCTCCCTC CGGCGGTGGT
AGCGACAGCG ACGTTTCTGC GGTGAGCAAG ATCTACGTGG TCTTTGGGAT GAGCGGGCAA
TTTTCCGATT TTTTCAATCT CTCCAATCTC GATGGCGACA ATGGTTTTAT CATCACCAAT
AGTACCCAGA CCGATAATTC TTTACGTTAC ATGGTGAGCG GTGCGGGGGA TGTGAATGGC
GACGGGTTTG ATGATCTGGT GTTTGGCAAC CCTTATGCCT CTCCCAATGG CCTTGATGGT
GCGGGCCAGA GCTACGTGGT GTTTGGGACG GACGGGGGCT TTCCTGCGGC GCTGGATCTC
TCCACCCTGG ATGGCAGCAA TGGCTTCGTG CTCAACGGTA TCGAGGCCGG TGACCATTCG
GGCCGTTCGG TGAGCGGTGC GGGGGACGTC AATGGTGACG GGTTTGATGA TTTGGTGATT
GGCGCGCCTG GTGCCGGCTT GGAGAAGCTT GTACCAAAAA TGAATAAAGC AGACCTCATC
GGCGCGTTCA CCGCCAGCCC CAACGGTCTT GACAGTTCGG GCCAGGGCTA TGTGGTATTT
GGGATGGACG GGGGCTTTCC CGCGGCGTTG GAACTATCCG AACTTGATGG CAGCAACGGC
TTTATCATCA ACGGCATCGG GCCCGGTGGC CGTTTGGGTC AGTCGGTGAG CGGTGCGGGG
GATGTCAATG GCGATGGACT CGCTGACATT GTAATCGGCG CCGGGAGCAA GAGCTACGTG
GTGTTCGGGA CGGCTTCGGG GGGTCCCGCG GTTTTGCTCA AGGGGCTGAT TGCTGAGGTT
GGGGCATTGG ATCTACCGGC GGGGCTTGAA CACTGGCTAA CCAGGCCGCT TAAAAGGGCC
GAGAGGAAGC TGGCCCAGGG CGAGGTAGCC AAAGCACTTT ACAAGGTAGT GGGGTTTATC
CAGCGGGCGA GAGTGTTGCG GAAATATGGG ATACTGCCGG CGGCCGAGGC CAACGCCCTC
ATTGCCCAGG CCAAGGCTAT TATCAAGGCG CTATTGGATT TGCCGCAGCT CTCGGGCGTT
GCCGCCTCGG ATCTTCTCCC CGCTGACTTG GTACCCATTG ATGAGCCGGT ACCGAGAAGT
CCCTCCGCCC CTACTCGATA A
 
Protein sequence
MNNQDNSISF SVSCHPDSAV EFSAVMPCAP APRPLAVALR RVLGGGLLAV GLAGPALSQN 
QDPVLSAVPV PNQSKSLELS TLDGSNGFVF NSPATSVSGA GDVNGDGFDD LVFGNPYASP
NGLDSAGQSY VVFGTGGDFP AALSSSDLNG DNGFTLNGIA TYSYSGLPDK LGSSVSSAGD
MNGDGFDDIL VGASGVHTFK NGDITGQSYV VFGTSGGFPP ALERSDLGGS NGFVIRNILS
GDYSGFSVSG AGDIDGDGFD DVLIGAKNLG RSGDYSVGYA DETYVIFGDS DGTSGNKILA
DTTNAYSESF TTSVSGAGDV NGDGLNDMLL STSGSPSGGG SDSDVSAVSK IYVVFGMSGQ
FSDFFNLSNL DGDNGFIITN STQTDNSLRY MVSGAGDVNG DGFDDLVFGN PYASPNGLDG
AGQSYVVFGT DGGFPAALDL STLDGSNGFV LNGIEAGDHS GRSVSGAGDV NGDGFDDLVI
GAPGAGLEKL VPKMNKADLI GAFTASPNGL DSSGQGYVVF GMDGGFPAAL ELSELDGSNG
FIINGIGPGG RLGQSVSGAG DVNGDGLADI VIGAGSKSYV VFGTASGGPA VLLKGLIAEV
GALDLPAGLE HWLTRPLKRA ERKLAQGEVA KALYKVVGFI QRARVLRKYG ILPAAEANAL
IAQAKAIIKA LLDLPQLSGV AASDLLPADL VPIDEPVPRS PSAPTR