Gene Noc_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3004 
Symbol 
ID3705712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3395949 
End bp3396986 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content49% 
IMG OID637739478 
Productpilus retraction protein PilT 
Protein accessionYP_344976 
Protein GI77166451 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATCG CTGAACTACT TGCCTTTAGT GTCAAGAATT CTGCCTCCGA TTTACACATC 
TCGGCGGGCC TCCCGCCAAT GATCCGAGTA GACGGCGACG TGCGCCGAAT TAACTTACCG
CCTATGGAGC ACAAGGATGT CCATGCCATG ATCTACGATA TCATGAATGA CAAGCAGCGT
AAGGATTATG AAGAATTTCT GGAGACCGAT TTTTCCTTCG AGATCCCTGG ATTAGCCCGT
TTCCGCGTGA ATGCGTTCAA TCAGAACCGC GGGGCAGCGG CGGTATTCCG CACCATTCCC
TCTCAAGTAC TCACCCTTGA AGAACTCGAT GCCCCCGCCG TATTTAAGAA TATCGCCGAT
AATCCCCGAG GGGTGGTGTT AGTGACCGGG CCTACCGGCT CTGGAAAATC AACTACCTTA
GCGGCTATGG TAAATTATAA AAATGAGAAT GATTTTGCTC ATATTTTAAC GATCGAGGAT
CCCATCGAAT TCGTCCACGA AAGCAAAAAA AGCCTGATTA ATCAACGGGA AGTCCATCGG
GATACCCATG GCTTTAGCGA AGCTCTGCGT TCAGCATTGC GGGAAGACCC TGATATTATT
CTGGTAGGTG AGCTGCGAGA TCTTGAGACT ATTCGTCTCG CATTGACAGC GGCAGAGACG
GGGCATTTAG TCTTCGGCAC TTTGCATACT AGTTCTGCGG CAAAGACTAT TGATCGTATT
ATCGATGTGT TTCCTGCTGC GGAGAAAGAC ATGGTGCGCT CCATGCTCTC CGAATCCCTG
CGGGCCGTAA TTTCCCAGAC TCTCCTTAAA AAATTTGGCG GTGGACGGGT TGCTGCCCAC
GAAATCATGA TTGGCAACCC GGCCATTCGT AATCTCATCC GCGAAGACAA GATCCCACAA
ATGTATTCTG CTATCCAAAC GGGCCAAGAG GCAGGTATGC AGACTCTGGA TCAATGTCTG
TCAGGGTTAC TTCGTCGCAA TATCGTCACT AAGCAGGAGG CAGCTAAGAA AGCCGTAAGC
AAAGAGCTTT TCCAGTAG
 
Protein sequence
MDIAELLAFS VKNSASDLHI SAGLPPMIRV DGDVRRINLP PMEHKDVHAM IYDIMNDKQR 
KDYEEFLETD FSFEIPGLAR FRVNAFNQNR GAAAVFRTIP SQVLTLEELD APAVFKNIAD
NPRGVVLVTG PTGSGKSTTL AAMVNYKNEN DFAHILTIED PIEFVHESKK SLINQREVHR
DTHGFSEALR SALREDPDII LVGELRDLET IRLALTAAET GHLVFGTLHT SSAAKTIDRI
IDVFPAAEKD MVRSMLSESL RAVISQTLLK KFGGGRVAAH EIMIGNPAIR NLIREDKIPQ
MYSAIQTGQE AGMQTLDQCL SGLLRRNIVT KQEAAKKAVS KELFQ