Gene Noc_2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2272 
Symbol 
ID3705064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2618951 
End bp2619979 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content52% 
IMG OID637738751 
Producttype IV pilus assembly protein PilW 
Protein accessionYP_344260 
Protein GI77165735 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4966] Tfp pilus assembly protein PilW 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGCC CATCTTACTA CTTACCTTCA AGCCGCGCGC AACGGGGTTT CTCACTAGTT 
GAGATTTTAG CGGGGATGAC CGCGGGCTTG ATACTCACTA CGGGGGTTAT TCAAGTCTTT
TCCAGTAGCA AACAAAGCTA CCGGCTTCAG GAAGCGCTCT CCCGACTCCA GGAGAACGGC
CGCTTTGCCG TAGAGTTTAT CTCTTACGAG GCCCGCAATG CCGGCTACTT TGGCTGCGCT
GGAACCGAAA CCCGGGTAGT TAATACCCTC AATAATGCTG AACAATTTGC TTGGGATTTT
TCTACTCCCC TGCAAGGTTT TGAGGCCACC AGCGCCAGCA GCTGGACGCC TGCCCTAGAC
GCTACCATCA CCCAACCCCT TGGTAACCGG GATGTGTTGA CCTTCCGTCA TACTTCCGGC
AACCTGGCCA AAGTTGAACC GCCCTTCATG CCTACTACCT CCGCGGCGCT CCACATAACG
CCTGATAACG GGCTCAAGAA ATCCGATATT GTCATGGTTT CCGACTGTGT CGATGCAGCC
ATTCTTCAAG TAACCAATGC TAACCCAGAC ACCTCCGGCA CTTTAGTCCA TAACACCGGC
AACGGGGTCA CCCCCGGCAA CGCCACTAAA GATCTGGGAA AAAGATATAC CGACAAGGCT
AATATTATCC AAATCACCAC CAGCACCTAC TACATTCGCG CTAATCCGCG AGGCGTTCCC
TCCCTTTATA GGAAAGAGAG CGACGACAAC CCGCAAGAAC TCATCGAAGG CGTGGAGGAT
ATGCAAATTC TCTACGGCGA AGATACGGAT GGCAGCCAGG AGGCCAATGG TTATGTGACT
GCGGATAACG TGGCCAATTG GAATAATGTG GTCAGCCTAC GTCTCAATTT TCTGTTGCAA
ACCATGGAAA ATAATCTCGC CTCCTCTCCC CAATCCTATA CCTTTAATGG CGCCACCATA
ACTCCCAGCG ACAGGCGGCT GCGCCGGGTA TTCACCACAA CCCTGAATTT GAGGAATAGA
ACGTTATGA
 
Protein sequence
MKRPSYYLPS SRAQRGFSLV EILAGMTAGL ILTTGVIQVF SSSKQSYRLQ EALSRLQENG 
RFAVEFISYE ARNAGYFGCA GTETRVVNTL NNAEQFAWDF STPLQGFEAT SASSWTPALD
ATITQPLGNR DVLTFRHTSG NLAKVEPPFM PTTSAALHIT PDNGLKKSDI VMVSDCVDAA
ILQVTNANPD TSGTLVHNTG NGVTPGNATK DLGKRYTDKA NIIQITTSTY YIRANPRGVP
SLYRKESDDN PQELIEGVED MQILYGEDTD GSQEANGYVT ADNVANWNNV VSLRLNFLLQ
TMENNLASSP QSYTFNGATI TPSDRRLRRV FTTTLNLRNR TL