Gene Noc_2895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2895 
Symbol 
ID3707449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3272426 
End bp3274168 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content54% 
IMG OID637739371 
Producttype II secretion system protein E 
Protein accessionYP_344871 
Protein GI77166346 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.847539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCAAC AACTTGGCAA AACTATTCCT ATTACGCCTG CTCCTGAAGA GGAGGCGGTT 
CCCCAAAGCC TGGAGGAAAA GCTTGGCGAG TATTTGATTG AGCGGCGCAA GCTGCGGGCA
GGTGAATTCC GCCATGCCTG GCGTTTGGCG GAGGAGGAAG GGCTGCGGCT TTCCGTTATG
CTCATAAGGC TGGGGTTGAT CTCGGAGCGG GATATGGCGG AAGCCTTTTC CGCCTTGCTC
GATCTGCCGC TGCTAGCAAC GAACGATTAT GATAATAGCG ATCTGAACGG TCAAGTTTCA
TTGCGGTTCC TCAAAGAATT CCGGGCCATT CCCGTGCTGG AGGAAGAGGA GGCAGTGGTG
CTGGCGATGG CCGACCCTAG TGATGATTAT GTCCAGGAAG CCATTGCCCT GGCTTACAGC
AAGCCCATCA GTCTGCGGGT CGGTTTGCCT TCGGATATTG AAGCGGCCCT TGATCGTCAG
GCCGGGGGAA AATCGGCCAT GGGACAGATC GTCGAACATC TGGGGGCGGA CGAAGATGGG
GAAACCGATG TCCAGCACCT GAAGGATATG GCCAGCGAGG CCCCGGTCGT CCGCCTGGTG
AATTTGTTGA TTCAGCGTGC GGTGGAATCC CGAGCTTCGG ATATCCATAT CGAGCCTTTT
GAAAACAGTC TTCGGGTCCG CTACCGGATC GATGGCGTGT TGCGGGATGT AGAGGCCCCC
CCGGCGCGTT CCACCGCTGC TGTGATTTCC CGCTTCAAGA TTATGGCTAA GCTTAATATC
GCTGAGCGGC GATTACCCCA GGATGGGCGA ATCCAGCTTC GTATCCAAGG GCGGGAGCTG
GACTTGCGGG TCTCGACAGT GCCGACTCTG TATGGCGAGA GCGTGGTGCT TCGGCTGCTA
GATAAAGGCA ATGTGGTGCT GGATTTCGAC TCCCTGGGCT TCCAGGGAAG TACCTTGGAG
CGTTTTTTGC ATGTATTGGA ACAGCCCCAT GGAATTATTG TCGTCACGGG TCCCACAGGT
AGTGGTAAAA GCACCACCCT TTATACCGCG TTGCATAAAC TCAATACTCC AGACCGAAAA
ATTGTAACGG TGGAAGATCC GGTGGAGTAT CAGCTTGAAG GGATCAACCA AATCCAGGTT
AAACCGCAGA TTGGCCTTAC TTTTGCGGGT GCTTTGCGGT CTATCGTGCG CCAAGATCCG
GATGTCATCA TGATTGGTGA AATGCGGGAT AAGGAGACTG CTGGCATCGC GGTTCAATCG
GCCCTGACGG GTCATGTGGT GCTCTCCACT CTCCATACCA ATGATGCGGC TGGAGGGGTT
ACCCGCTTAT TGGATATGGG AATAGAGGAT TATCTGCTGA CCTCCACTGT CAATGGTATT
TTGGCCCAAC GCTTGGTACG CACTCTCTGT AAACATTGCA GTGAACCTTA CGAGGCCCTG
CCTGAACTTA TAGAGGAGCT GGAATTGCAT CGCTTCAGCC AGGAGAAACC CGTGATGTTG
CATCGGGCCG TTGGCTGCGA GCAATGTAAC GGGATTGGTT ATCATGGGCG GGCGGCGATC
ATGGAATTTC TGGTGATGAG CGATCCTATC CGCCGTCTTG TGCTCCAGCA TACCGATGCG
GGGGAGATTC AAAAGCAAGC TCAGAAAGAA GGCATGCGTA TTATGTATGA AGATGGTCTT
CATAAAGCAC TCTCTGGCTT CACTACTATT GAAGAGGTGA TCCGAGTCAG CCAGGAATCG
TGA
 
Protein sequence
MAQQLGKTIP ITPAPEEEAV PQSLEEKLGE YLIERRKLRA GEFRHAWRLA EEEGLRLSVM 
LIRLGLISER DMAEAFSALL DLPLLATNDY DNSDLNGQVS LRFLKEFRAI PVLEEEEAVV
LAMADPSDDY VQEAIALAYS KPISLRVGLP SDIEAALDRQ AGGKSAMGQI VEHLGADEDG
ETDVQHLKDM ASEAPVVRLV NLLIQRAVES RASDIHIEPF ENSLRVRYRI DGVLRDVEAP
PARSTAAVIS RFKIMAKLNI AERRLPQDGR IQLRIQGREL DLRVSTVPTL YGESVVLRLL
DKGNVVLDFD SLGFQGSTLE RFLHVLEQPH GIIVVTGPTG SGKSTTLYTA LHKLNTPDRK
IVTVEDPVEY QLEGINQIQV KPQIGLTFAG ALRSIVRQDP DVIMIGEMRD KETAGIAVQS
ALTGHVVLST LHTNDAAGGV TRLLDMGIED YLLTSTVNGI LAQRLVRTLC KHCSEPYEAL
PELIEELELH RFSQEKPVML HRAVGCEQCN GIGYHGRAAI MEFLVMSDPI RRLVLQHTDA
GEIQKQAQKE GMRIMYEDGL HKALSGFTTI EEVIRVSQES