Gene MCA1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1114 
SymbolxpsE 
ID3103952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1169534 
End bp1171291 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content64% 
IMG OID637170299 
Productgeneral secretion pathway protein E 
Protein accessionYP_113584 
Protein GI53804537 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGG CAGTATCGAG AAAGACCGAG GCGGAACCGC TCGTGCCCGC CGAGTTGTTG 
CAGGATCCTG ACGGCTATGG CCGATTCGCC GATGCCCTGC TTCGGCGCGG CAAGGTCCGC
GAAATGGATC TGGCAAGGGC GAAGCGCCTC GCGGCACAGG CGGACGAGCT GCGCCTCCCC
GCCTTGCTGG TCAAGCTGGG CGTGGTCTCG GAGCGTGACG TGGCCGAGAC CCTGGCCGAA
GCCAGCGGTC TTCCCCTGAT CGGTCCCACC GATTATCCCG ACGTGTCGCC CTTGCCGGAA
GGCATCGCGT CCCGGTTTCT CAAGGATCGT CATGCCGTCG GCATCGCGGC CAGGGCCGAC
GGTTTCGTCG TGGCGGTGGA GGATCCGTTC GATGCCGAGC TGATCCATGC GCTCGGGCTG
GCCTGTGGTC AGCCGGTCCA TCCGGTGGTC GGTCTGGCCT CGGAGATCGA CCGGGCGCTG
GAGCAGCAGA TCGGCTCGGG CCGTTCGGTG ATGGGCCAAA TCGTCGAAAA CCTGGGCGGC
GACGAGGACG CCGACGAGGC CGACGTCGAA CACCTGAAGG ATCTCGCCAG CGAAGCGCCG
GTGATCCGCA TGGTGAACCT CATCATGCAG CGCGCGGTGG AATCGCGCGC CTCGGACATC
CATGTCGAGC CCTTCGAGCA GACGCTCAAG GTACGGTTCC GCATCGACGG CGTGTTGCGC
GATGTGGAAG CGCCGCCGGT ACGCTCCACC GCGGCGGTGA TTTCGCGCAT CAAGATCATG
GCCAAGCTCA ACATCGCCGA GCGGCGCCTG CCGCAGGACG GCCGGATCAA GCTGCAGGTG
CAGGGGAAGG AGCTGGATCT GCGCGTTTCC ACCGTGCCCA CGATGTACGG TGAGAGCGTC
GTGATCCGGC TTCTGCACAA AGAGAGCATC ACCTTCGATT TCGGCACGCT GGGATTCGAT
GGTTCGGTGT TGCGGCGTTT CCTCGAGATC CTCGAGCTAC CCCACGGTAT CATCCTGATC
ACGGGACCCA CCGGCAGCGG CAAAAGCACC ACGCTGTATA CCGCCCTGCA CAAGATCAAC
ACGCCTTCGC GCAAGATCAT CACGGTCGAG GACCCGGTGG AATACCAGCT GGAAGGCGTC
AACCAGATCC AGGTCAAGCC CCAGATCGGT CTGAATTTCG CGAGCGCGCT GCGCTCCATC
ATGCGTCAGG ACCCGGATGT GATCATGATC GGTGAGATGC GCGATCTGGA GACGGCCCGT
ATCGCCGTGC AATCGGCGCT GACCGGCCAC CTGGTACTGT CCACGCTGCA CACCAATGAC
GCCGCCGGCG GCGTGACCCG TCTTTTGGAC ATGGGGCTCG AGGACTATCT CATCACCTCG
ACCGTGAATG GTATTCTCGG CCAGCGTCTG GTGCGGCGGC TGTGTCAGAG TTGCCGCGAA
CCACATCCGG CGCTGGAGGA GGTCGCGGAA GAAATGGGGC TGCGGCGGTT CCAGCGTGAC
GGCGAGGTGG TGCTGTACCG GCCGGTCGGC TGCGAACAAT GCGGCGGCAC CGGTTTTCGT
GGACGGCTCG CGATCCTCGA GTTCCTGGTC ATGTCCGACG AGGTGCGGCG GTTGGTGATG
AGTCACGCCC AGGCGCGGCA GATCGAGGAG GTCGCCCTAC GCGAAGGCAT GCATACCATG
TATGACGATG GTGTCCGCAA GGCTTTGATG GGGCTGACCA CCGTCGAAGA GGTCCTGCGC
GTCACCTCGG ATTCCTGA
 
Protein sequence
MTQAVSRKTE AEPLVPAELL QDPDGYGRFA DALLRRGKVR EMDLARAKRL AAQADELRLP 
ALLVKLGVVS ERDVAETLAE ASGLPLIGPT DYPDVSPLPE GIASRFLKDR HAVGIAARAD
GFVVAVEDPF DAELIHALGL ACGQPVHPVV GLASEIDRAL EQQIGSGRSV MGQIVENLGG
DEDADEADVE HLKDLASEAP VIRMVNLIMQ RAVESRASDI HVEPFEQTLK VRFRIDGVLR
DVEAPPVRST AAVISRIKIM AKLNIAERRL PQDGRIKLQV QGKELDLRVS TVPTMYGESV
VIRLLHKESI TFDFGTLGFD GSVLRRFLEI LELPHGIILI TGPTGSGKST TLYTALHKIN
TPSRKIITVE DPVEYQLEGV NQIQVKPQIG LNFASALRSI MRQDPDVIMI GEMRDLETAR
IAVQSALTGH LVLSTLHTND AAGGVTRLLD MGLEDYLITS TVNGILGQRL VRRLCQSCRE
PHPALEEVAE EMGLRRFQRD GEVVLYRPVG CEQCGGTGFR GRLAILEFLV MSDEVRRLVM
SHAQARQIEE VALREGMHTM YDDGVRKALM GLTTVEEVLR VTSDS