Gene Noc_0672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0672 
Symbol 
ID3706920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp728217 
End bp729545 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content55% 
IMG OID637737180 
Producttransposase IS4 
Protein accessionYP_342721 
Protein GI77164196 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCTC AAACGCCACC CGATTTACCC ACCGATGATC TGTTTCGTCA CCGCCTGGAA 
AACCTCATTG ATACGCGCCA TGAGCTGGCC AAACTCGCAG CGTTAATTGA TTGGGAGTTC
TTTGATGCAC AGTGGGGTGA GGCATTTTGT GAGAACGGTC GTCCTGCCAT CGCGACCCGA
TTGATTGCGG GTCTGCACTA CTTGAAACAC ACCTACGGCT TGTCCGATGA ACAGGTGGTG
CAACGCTGGG CAGAGAATCC GTACTGGCAA TACTTTTGTG GTGAAAGGTA TTTTCAACAC
GAGCTGCCAT TGAACCCGAG TTCGTTGACA CGTTGGCGTC AGCGTTTAGG TGACGAGGGT
ATGGAATCGT TACTGAGTGC GACGATCGAT GCAGCGATTG CGTCGAAGGC GGTGAAAGCA
CGAGATTTAA AGTGCGTTAC GGTTGACACG ACGGTGCAGG AGAAAGCGAT TGCGTTCCCC
ACCGACTCGA AGCTGTACAA CCGGGCCCGT GAGCGCCTGG TACGCTTGGC GAAAGCACAC
GGCGTACCGT TGCGCCAAAG CTACGTGCGC GTGGGCCCGC GACTTTTGTT CAAGAACAAC
CGCTACGGTT ATGCGCGACA GACACGCCGC ATGCGCCGCA CCGCCGCGAA GCTGAAGACC
GTGCTGGGGC GGGTGGTTCG GGACATTGAG CGCAAATTGC CGAAGCAATC GGCCTCGGTG
CAAGCGGCCT TTGCAGAGTC GATGGCATTA ACCAAACGAT TGCTCGATCA ACAACGTCAC
GATAAAAATA AACTCTACGC GCTGCACGCA CCGGAGGTTG AGTGCATTGC CAAAGGCACG
GCGCACAAGC GCTACGAGTT TGGTGTCAAG GTTAGTATCG CCACCACCAA CCGTTCCAAC
CTGGTGGTGG GCGCACAGTC ACTGCCAGGC AGTCCCTACG ATGGTCATAC GCTGAAGAAA
GCCTTGCACC AGGTTGAGCG ATTAACCGGA CAACGGCCCG AGCGTTGTTA TGTGGATCTA
GGTTATCGCG GTCACGATGT TGACGACGTC GACGTATTCA AAGCTCGACA AAAGCGTGGC
GTCACGCGCA CTATCCGACG TGAGTTAAAA CGACGTAACG CTATCGAGCC GATCATTGGT
CACATGAAAA ACGACGGCCT GTTGCATCGC AATTATCTCA AAGGGGTCGA GGGCGATGCA
ATCAACGCCA TCCTGTGCGG TGCCGGCCAG AATCTCCGGC TGATCCTCAG GTACCTGAGG
ATTTTTTGGC TCAAAATCCA ACCGGCTTTT ATACAATACC TGTTACTTGC TCCACCTCGT
GCAGCCTGA
 
Protein sequence
MKPQTPPDLP TDDLFRHRLE NLIDTRHELA KLAALIDWEF FDAQWGEAFC ENGRPAIATR 
LIAGLHYLKH TYGLSDEQVV QRWAENPYWQ YFCGERYFQH ELPLNPSSLT RWRQRLGDEG
MESLLSATID AAIASKAVKA RDLKCVTVDT TVQEKAIAFP TDSKLYNRAR ERLVRLAKAH
GVPLRQSYVR VGPRLLFKNN RYGYARQTRR MRRTAAKLKT VLGRVVRDIE RKLPKQSASV
QAAFAESMAL TKRLLDQQRH DKNKLYALHA PEVECIAKGT AHKRYEFGVK VSIATTNRSN
LVVGAQSLPG SPYDGHTLKK ALHQVERLTG QRPERCYVDL GYRGHDVDDV DVFKARQKRG
VTRTIRRELK RRNAIEPIIG HMKNDGLLHR NYLKGVEGDA INAILCGAGQ NLRLILRYLR
IFWLKIQPAF IQYLLLAPPR AA