Gene Noc_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0166 
Symbol 
ID3706199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp183948 
End bp185312 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID637736683 
ProductTPR repeat-containing protein 
Protein accessionYP_342229 
Protein GI77163704 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0362369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCC AAGGACGACT CGATGTGCTA CAGAAGAAAG CTAAGGCAGC ATTAGCAATT 
CGACACTACA AAGAGGCAGA ATCCCTATTA CAAGAGTTAT TAGAAACCCA AGTCCAGCAC
TTTGGTGATG CAGACACGCA GATCGCCACC ACACTGAACA ATCTCGCAGC TCTTTATGAA
GCCCAAGGGC GGTATGCTCA GGCCGAGGAG CTTTACCATC GCTCGCTTGC TATCCGCGAA
CAGTTGCTAG GCCCGGACCA CCCCGAGGTT GCCACCACAC TGAACAATCT CGCAGCTCTT
TATGAAGCCC AAGGGCGGTA TGCTCAGGCC GAGGAGCTTT ACCATCGCTC GCTTGCTATC
CGCGAACAGT TGCTAGGCCC GGACCACCCC GAGGTTGCCA CCACACTGAA CAATCTCGCA
GCTCTTTATG AAGCCCAAGG GCGGTATGCT CAGGCCGAGG AGCTTTACCA TCGCTCGCTT
GCTATCCGCG AACAGTTGCT AGGCCCGGAC CACCCCGAGG TTGCCACCAC ACTGAACAAT
CTCGCGGCGC TCTATAAGAA ACAAGGGAGG TACGCTCAGG CCGAGGAGCT TTACCATCGC
TCGCTTGCTA TCCGCGAACA GTTGCTAGGC CCGGACCACC CCGAGGTTGC CACCACACTG
AACAATCTCG CAGCTCTTTA TGAAGCCCAA GGGCGGTATG CTCAGGCCGA GGAGCTTTAC
CATCGCTCGC TTGCTATCCG CGAACAGTTG CTAGGCCCGG ACCACCCCGA GGTTGCCACC
ACACTGAACA ATCTCGCAGC TCTTTATGAA GCCCAAGGGC GGTATGCTCA GGCCGAGGAG
CTTTACCATC GCTCGCTTGC TATCCGCGAA CAGTTGCTAG GCCCGGACCA CCCCGAGGTT
GCCACCACAC TGAACAATCT CGCGGCGCTC TATAAGAAAC AAGGGAGGTA CGCTCAGGCC
GAGGAGCTTT ACCATCGCTC GCTTGCTATC CGCGAACAGT TGCTAGGCCC GGACCACCCC
GAGGTTGCCA CCACACTGAA CAATCTCGCA GCTCTTTATG AAGCCCAAGG GCGGTATGCT
CAGGCCGAGG AGCTTTACCA TCGCTCGCTT GCTATCCGCG AACAGTTGCT AGGCCCGGAC
CACCCCGAGG TTGCAATCAT GCTAAATAAT CTTGCTGGCT TGTACAGGGC GACGGGATTG
GGTGAGAAAG CAGAAAGTTT GTATGACAGA AGCTTGGCGG TAATGGAAAA AATATTCGGG
CCAAGACATC CAAATACTGC AATCGTACGA GCCAACCGCG ATGCTTATAA ACATACGGCA
CCTAACAAGG CAAATTCAGC CGACGCCAAA AAGCGGCGCG GCTGA
 
Protein sequence
MKLQGRLDVL QKKAKAALAI RHYKEAESLL QELLETQVQH FGDADTQIAT TLNNLAALYE 
AQGRYAQAEE LYHRSLAIRE QLLGPDHPEV ATTLNNLAAL YEAQGRYAQA EELYHRSLAI
REQLLGPDHP EVATTLNNLA ALYEAQGRYA QAEELYHRSL AIREQLLGPD HPEVATTLNN
LAALYKKQGR YAQAEELYHR SLAIREQLLG PDHPEVATTL NNLAALYEAQ GRYAQAEELY
HRSLAIREQL LGPDHPEVAT TLNNLAALYE AQGRYAQAEE LYHRSLAIRE QLLGPDHPEV
ATTLNNLAAL YKKQGRYAQA EELYHRSLAI REQLLGPDHP EVATTLNNLA ALYEAQGRYA
QAEELYHRSL AIREQLLGPD HPEVAIMLNN LAGLYRATGL GEKAESLYDR SLAVMEKIFG
PRHPNTAIVR ANRDAYKHTA PNKANSADAK KRRG