Gene Noc_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1481 
Symbol 
ID3706011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1640730 
End bp1641920 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content50% 
IMG OID637737968 
Producthypothetical protein 
Protein accessionYP_343497 
Protein GI77164972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.363757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA GATACCCTCT TCCCGTTTTT TACACTGTTG CCCTTACGCT TAGTTTAATA 
GTTCCGATGT TGTTTGCCCC CCGCGCTCAG GGAGAAATTC TCGCCATGCT CAATTACGAG
GCAAAACCAG AGCAGCGCAT TCAAAAAGAG GGGCTCGCCA TCATCGATGT TGACCCCAAT
TCACCCAATT TTGGCAAAAT GCTGATGGAT ATCCCCCTGC CCCCTGGTTT GGTGGCCCAT
CATCTTTACT ATAACCAAGA CCATAGCAAA ATCTACATAA CCGCCCTAGA AAAAAGCATC
TTGCATGTCC TGGACATGAC CCAGTTTCCC TATCGGATGA AAATGGTAGA GATTCCCCAA
TGCAAAGTCC TGGAAGACAT GGCCTTCTCC AAAGATAAGC AAACTTGGTA TCTCACCTGT
ATGGGGTCTA GTAACGTTAT CGTAGGCAAT GCCAGCACGG ATAAGCCTAT CAAATCCATC
AAGACTCCAC CTTCAGACTC AGCCTTTATT CGTTATCCCC ATGGAATCGC CCTCCATGAT
GACCTTGACC GCTTGTTAGT CACGAGTACG GTCCGGCACT CGGATTTAGG CGACCCAGGA
GAAACCATTA CGGCGATTGA GGCCAGTTCA GGGAAAGTGC TGTCCACCCA TAAGGTTTCC
ATGAAGCCCT CTCCTTCAGG AGCAGCCCCC GTCGAGGTTC ACTTTCTGCC CGGTGCTGAA
CCTCCGCTGG CCTATATCAC CAATATGTAT GAAGGAAACC TTTGGACAGC GGTTTGGGAT
TCGGATAAAA AAGTATTCGA TTTCCAGCAA GTGGCGGACT TTGCGCCTCA CGGCCGGGGC
GTTCCCCTTG CCCTGGAATT CAATCGCAAG GGTGACCGGT TTTTTGTCAC CACGGCCCAA
CCTGGTCATT TGAATATTTT TGATATCAGC GATCCCCAGG CTCCCGAGCT GCTGAAAGCT
ATTCCCACGG CTCCTGGTGC CCATCATATA GTGTTATCGC CTGATGAGCG TTACGTTTTT
GTCCAGAATA GCTTCCTCAA TCTACCGGAG ATGAGTGATG GCTCCATTAC CGTGGTTGAT
CTCGCAAAAG GCGAAGCGAT AGCGCAAATC GACACCCTCA AAAACCAAGG ATTCAATCCC
AATTGCATTG TGCTTTTACC GGAGTGGGCT TCAGGCGGGC ATTCCCACTA A
 
Protein sequence
MKQRYPLPVF YTVALTLSLI VPMLFAPRAQ GEILAMLNYE AKPEQRIQKE GLAIIDVDPN 
SPNFGKMLMD IPLPPGLVAH HLYYNQDHSK IYITALEKSI LHVLDMTQFP YRMKMVEIPQ
CKVLEDMAFS KDKQTWYLTC MGSSNVIVGN ASTDKPIKSI KTPPSDSAFI RYPHGIALHD
DLDRLLVTST VRHSDLGDPG ETITAIEASS GKVLSTHKVS MKPSPSGAAP VEVHFLPGAE
PPLAYITNMY EGNLWTAVWD SDKKVFDFQQ VADFAPHGRG VPLALEFNRK GDRFFVTTAQ
PGHLNIFDIS DPQAPELLKA IPTAPGAHHI VLSPDERYVF VQNSFLNLPE MSDGSITVVD
LAKGEAIAQI DTLKNQGFNP NCIVLLPEWA SGGHSH