Gene Noc_0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0312 
Symbol 
ID3706483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp338120 
End bp339835 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content51% 
IMG OID637736826 
Producttype II secretion system protein E 
Protein accessionYP_342370 
Protein GI77163845 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACTC CTGGAGTACA GATCATAATG AGCGGTCTGG TCCGCCGTTT AGTAGAGGAT 
GGATTGCTTA GCGAAGCCAA TGCCCGGCAA GCCCAAGAGC ATTCCCGCAA GAACGGTACT
CCTTTAGTGA CTTACCTGGT ACGGCAAAAG CTACTTAAAA GTCTGGATAT TGGTCAAGCA
GCCTCCCAGG AATTCGGCGT CCCCCTCTTC GATATCAATG CCCTGGATTT GGATTATCTT
CCCAAAGGGC TAGTGGAAGA AAAATTGGTT CGCCAACACC AGGCACTACC CTTATTCAAG
CGTGGGAATC GTCTATTCGT TGCCGTGGCC GATCCCACCA ACCTGCAAGC ATTAGACGAA
ATCAAGTTCC ATACGGGAAT TAATACCGAA GCTATCCTTG TTGAAGAAGA CAAACTAGCA
CGAATGATCG ATCGGGCCAT GGAGGCCCAA GACACTTCCT TAGCGGAGCT AAACGATACG
GCACTAGATG ACCTTGACAT CTCCGGGGGT GAAAATGAGC CTCCAGAAAA TACTGCTGAA
TCGGAGGCTG ATGATACCCC AGTAGTTCGT TTTATTAATA AGGTTCTGTT GGATGCTATT
CATCAGGGAG CATCGGATAT TCATTTTGAG CCCTATGAAA AAATCTATCG GGTCCGATAT
CGGCAAGATG GGGTGCTGCG GGAAGTCGCT ACCCCGCCGG TTACTCTGGC CGGCCGGCTA
GCAGCCCGCC TCAAGGTCAT GGCTCGCTTG GATATTTCCG AACGGCGCGT ACCTCAAGAT
GGGCGCATGA AAATGAATAT TTCCAAGAAC CGTGCCATTG ATTTTCGGGT AAATAGCTGC
CCCACCTTAT TCGGCGAAAA AATTGTTCTT CGTATCCTGG ACCCCTCCAG CGCGCAAATG
GGAATCGAGG CACTGGGTTA TGAAGAAAGA CAACAACAAA TATTTCTAGA AACCATTCAC
CGCCCTTATG GGATGGTGCT CGTTACCGGC CCCACGGGTA GTGGTAAAAC GGTATCTCTT
TATACCGCGC TTAATATTCT TAATACCGCG GACCGTAATA TCTCCACTGC CGAAGATCCA
GCCGAAATCA ATCTGCCTGG TATTAACCAG GTCAATGTCT ACCCCAAAGT CGGTTTGACT
TTTGCCGGTG CCCTTAAAGC CTTCTTACGT CAGGACCCCG ATGTCATCAT GGTGGGGGAA
ATTCGTGATC TGGAAACGGC TGAAATTGCT ATTAAAGCTG CCCAAACCGG CCATATGGTG
CTTTCCACCC TTCATACCAA CGACGCGCCC CAAACCCTGA CCCGGCTATT AAACATGGGG
GTGGCGTCCT ATAACATTGC CTCGGCAGTT TCTTTGATCA TCGCCCAACG CCTGGCCCGG
CGTCTCTGCT CCCGCTGCAA GGCACCCGAA AAGATATCCC ACGAAGTGCT GCTAGAAGAG
GGTTTTACCA AAACCCAACT GGAAACCACG CCCACCCTTT ATAAGGCCGC AGGCTGCGAG
CACTGTACAA AAGGCTATAA AGGCCGCGTG GGCATCTATC AGGTCATGCC CGTATCGGAA
GAGATGGGCC GCATCATTAT GGCGGGAGGC AACTCCATGG AACTTGCGGA GCAGTCAAAA
AAAGAAGGCG TCGCCGATCT CCGCCAATCG GGGCTTAAAA AAATTATTGA TGGCGTGACT
AGTATAGAGG AAATCAACCG AGTGACGAAG GAGTAA
 
Protein sequence
MATPGVQIIM SGLVRRLVED GLLSEANARQ AQEHSRKNGT PLVTYLVRQK LLKSLDIGQA 
ASQEFGVPLF DINALDLDYL PKGLVEEKLV RQHQALPLFK RGNRLFVAVA DPTNLQALDE
IKFHTGINTE AILVEEDKLA RMIDRAMEAQ DTSLAELNDT ALDDLDISGG ENEPPENTAE
SEADDTPVVR FINKVLLDAI HQGASDIHFE PYEKIYRVRY RQDGVLREVA TPPVTLAGRL
AARLKVMARL DISERRVPQD GRMKMNISKN RAIDFRVNSC PTLFGEKIVL RILDPSSAQM
GIEALGYEER QQQIFLETIH RPYGMVLVTG PTGSGKTVSL YTALNILNTA DRNISTAEDP
AEINLPGINQ VNVYPKVGLT FAGALKAFLR QDPDVIMVGE IRDLETAEIA IKAAQTGHMV
LSTLHTNDAP QTLTRLLNMG VASYNIASAV SLIIAQRLAR RLCSRCKAPE KISHEVLLEE
GFTKTQLETT PTLYKAAGCE HCTKGYKGRV GIYQVMPVSE EMGRIIMAGG NSMELAEQSK
KEGVADLRQS GLKKIIDGVT SIEEINRVTK E