Gene Noc_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1458 
Symbol 
ID3706027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1614546 
End bp1615649 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content55% 
IMG OID637737947 
ProductWD-40-like beta propeller repeat-containing protein 
Protein accessionYP_343476 
Protein GI77164951 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0940503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTTG GAGGTAAATT CATGCGAGAC ATCAATACTG TCGCTTCCAG TTGGCTCCTA 
GTGGCGACGC TCTGCGCGGG CGCTGCTGGG GCTATCGCGG CAGAACCCAC TAACAGTCCC
GCTTCCCCGA CGGAGGAGCT TCGCTATCCC GGCGAAAAGC ACCTGACCCA TCTCCGTCAA
CTTACCTTTG GCGGCCAGAA CGCCGAAGCT TATTTTTCTT TTGACGGCAA GGAGCTGATT
TTTCAGTCTA CCCGTGACGA TTTGAAGTGC GACGCTATCT TCCGTATGGA TGCGGATGGC
AAGAATGTCC GCCAGGTCTC CTCTGGCAAA GGGGTGACCA CCTGCTCATT TATTGCCCCG
GACGACCAGG CCATTATCTA TGCCTCTACT CATCTGGCGG GCGACCAGTG TCCCCCCAAA
CCTGACTACT CCCAAGGCTA CGTCTGGCCC TTGTACGAGG GCTATGACAT TTTCCGGGCT
GATCCGGAAG GCGGCAATCG GGTGCGCCTG ACCGACACCC CTGGCTACGA TGCCGAGGGA
GTTTACTCTC CCAAGGGGGA CAAAATTATC TTTACGTCAG CCCGCACGGG GGATCTGGAA
CTGTTCATGA TGAACCCGGA CGGCAGCGAG GTGGAACAGC TCACTGATCA ACCCGGCTAC
GATGGCGGGG CCTTTTTCTC CCGGGATGGT CAATGGATCG TCTGGCGCGC CTCCCGTCCC
GAAGGTAAAG CCCTGGCCGA TTATCAAAGC TTGCTCAAAC AAGGACTCAT CCGTCCTAGC
CAACTGGAAA TTTTCATCAT GAACCTAGAG GAACGCAAAC CTATCCAGCT CACCGACAAT
GGCGCCGCCA ACTTCGGCCC CTATTGGCAT CCGGATGGCA AGCACATCAT CTTCTCCTCC
AACATGCACA ATCCCAAAGG CCGCAACTTT GATCTGTTCC TGATTAACGT GGATACCCGG
GAAATCGAGC AGATCACTCA TCATCCTGAT TTCGATGGCT TTCCCATGTT CTCCCACGAT
GGCAAAAAGC TGGTATTTGC CTCCAACCGC AACGGCAAGG TGGAAGGGGA AACCAATGTC
TTCATGGTAG ATTGGCGCTG GTAA
 
Protein sequence
MTLGGKFMRD INTVASSWLL VATLCAGAAG AIAAEPTNSP ASPTEELRYP GEKHLTHLRQ 
LTFGGQNAEA YFSFDGKELI FQSTRDDLKC DAIFRMDADG KNVRQVSSGK GVTTCSFIAP
DDQAIIYAST HLAGDQCPPK PDYSQGYVWP LYEGYDIFRA DPEGGNRVRL TDTPGYDAEG
VYSPKGDKII FTSARTGDLE LFMMNPDGSE VEQLTDQPGY DGGAFFSRDG QWIVWRASRP
EGKALADYQS LLKQGLIRPS QLEIFIMNLE ERKPIQLTDN GAANFGPYWH PDGKHIIFSS
NMHNPKGRNF DLFLINVDTR EIEQITHHPD FDGFPMFSHD GKKLVFASNR NGKVEGETNV
FMVDWRW