Gene Oter_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_3083 
Symbol 
ID6207185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp4015687 
End bp4017339 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content66% 
IMG OID641692748 
Producttype II secretion system protein E 
Protein accessionYP_001819963 
Protein GI182414897 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0750364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAC GAGGCCTCCT GTCAAAAGCG CAGACCTCCA CCGCGTGTGC GTCCGCCGCA 
GCGGTGGAGC GGTTGGCGGG GCAGTTCGGC CTGCCGATCG CGAGACTTTC CGCGGAGGTT
CGGATCGCGC CCGAGGTGCT GGCGCTGGTG CCGCGGGTGT TTGCAGCGCG CCATGGATTG
CTGCCGCTCG CGGCCGACGC GGAGAGTCTG CAGGTCGTGG TGAGTGATCC GCTCGCGACC
GCGGGCGTTG ACGAACTCAG CCAGCAACTC GGTCGTCGCA TCGACATCGC TTTGGCCACG
TCGGCCCAGA TTGCCGAAGC GATCGAACGA TGCTATGGCT CCGCTCCCGC CACGAGCCAC
GGCGAAGATC AACACGACGA ACCCCGCCTC GAAGTCGCCA GCGCCACGCC GACACCGACC
ACGATGACTG CAGGTGGAGC GACGGCGATG GCGGATGACG AGGCCGATGC GCCGATCATC
CGCTTCGTGC AGTCGATCAT TGCGCAGGCG GTGCGGCGCC GAGCCTCGGA TATTCATTGG
GAGCCGCTCG AGCGGCGTTT TCGCGTTCGT TACCGGATCG ACGGGGTGCT GGTCGAAGTG
GAGAACCCGC CCAAGCGACT GCAGCTCGCC GTGGTTTCGC GGATCAAAAT CATGGCGAAC
ATCTCGATCG CCGAGAAACG CCTGCCGCAG GACGGCCGCA TCGCGATCAC GATCGACGGG
CAGGCGCTGG ATTTGCGCGT CTCGACCTTG CCCACGGCGC ATGGCGAAAG CGTGGTCATG
CGGATCCTGG ACAAGGCGAG CCTGCGTCGC GGGCTGCCGG AGCTCGGTTT CGCGGCGGAT
GATCAGGCGC GGTTCGAACG CCTGCTCGCG TTGCCCGATG GGATGGTGCT CGTCACCGGG
CCGACGGGCT CGGGCAAGAC AACGACGCTG TATGCCTGCT TGCAGCGGCT GAACCTGCCG
GATCGCAAAC TCATCACCGT CGAGGACCCA ATCGAGTATC AACTCACCGG CATCAATCAG
GTGCCGGTGC GTCCGGAGGC AGGAGTCACC TTCGCGGCGG CGCTGCGCGC GATCCTGCGG
CAGGCGCCGA ATATCGTCAT GATTGGCGAA ATCCGGGATC TGGAGACAGC GGAGATCGCG
ATCCACGCCG CGCTTACCGG GCACCTGGTG TTTTCGACGT TGCACACCAA CGACGCCGTC
GGCGCGGTGA CCCGGCTGAT CGATCTAGGG GTGAAACCAT TTCTGGTGGC GAGCGCGCTG
CGGGCGGTTT TGGCGCAGCG GCTGGTTCGA AAAACCTGTG AAAAGTGCGC GCACCCGTAT
GAGCCGCCAA CGGCGTTTCT GGCCGCGCTG AACATCCCGC CGGCAGTCGC GTCCGCGGCT
CGCTTCCGCC GCGGGACGGG CTGCCCGGCG TGCCTGCGCA CCGGTTATCG CGGACGGACG
GGCATTTTTG AACTGTTGGA AATCGACGAT GAGCTGCAGC GCCTGATCCA TGCGCGCAGT
CGGACGGCGT TGCTGCGGGC CCATGCGCGT GCCGCCGGCA TGCGAAGCTT GGCCGAGGAC
GGAGCGCGGC AGGCCAGCGC TGGTTTGACC ACCGTCGAGG AAGTCGTCTC GATCACGGTC
GGTGACGCCA GCCAAGGTCC AACCTTCCAA TGA
 
Protein sequence
MPERGLLSKA QTSTACASAA AVERLAGQFG LPIARLSAEV RIAPEVLALV PRVFAARHGL 
LPLAADAESL QVVVSDPLAT AGVDELSQQL GRRIDIALAT SAQIAEAIER CYGSAPATSH
GEDQHDEPRL EVASATPTPT TMTAGGATAM ADDEADAPII RFVQSIIAQA VRRRASDIHW
EPLERRFRVR YRIDGVLVEV ENPPKRLQLA VVSRIKIMAN ISIAEKRLPQ DGRIAITIDG
QALDLRVSTL PTAHGESVVM RILDKASLRR GLPELGFAAD DQARFERLLA LPDGMVLVTG
PTGSGKTTTL YACLQRLNLP DRKLITVEDP IEYQLTGINQ VPVRPEAGVT FAAALRAILR
QAPNIVMIGE IRDLETAEIA IHAALTGHLV FSTLHTNDAV GAVTRLIDLG VKPFLVASAL
RAVLAQRLVR KTCEKCAHPY EPPTAFLAAL NIPPAVASAA RFRRGTGCPA CLRTGYRGRT
GIFELLEIDD ELQRLIHARS RTALLRAHAR AAGMRSLAED GARQASAGLT TVEEVVSITV
GDASQGPTFQ