Gene MCA1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1053 
Symbol 
ID3104049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1103820 
End bp1105580 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content63% 
IMG OID637170237 
ProductTPR domain-containing protein 
Protein accessionYP_113528 
Protein GI53804652 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.038306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTAAAA TTCGGATACG AAAGTTCTGG CCCAGTGGTG CAGGTACTGT GCCAGGTGTG 
CTGTTGATTG CCCTGGCACT CGGGTGCGTG GGCCACAGGG CCAAATTTGC GGACGAAACG
TTCTCGAGCG ATGAGGCCGA GATCCGTTCC GCGGTCGAGC CGCTGCCGCC CAAAGCTCAG
CTCGTCTACC TGGTGCTGGC CGGGGAGTTG GCCGGCCAGC GCGGCCGCTA CGAGGTTGCG
CTCGAGCATT ATCTCCAGGC CGCGCGCCTG TCGCGGGACG GGCGCCTTGC GGAACGGGCG
ATGCAGATCG CCCTGTTTAT AAAAAAATAT CCCGAAGCGG TCGAAAGCGT GGCGCTCTGG
CTGAAGGCCG AACCCCGCCA CGCGGGGGCC CGCCGCATGG CGACCCTGCT CTACCTGAAA
GAAGGACGGC GTGACGAGGC GGTGACACAG ATGAAAGTGT TGCTGACGCT GCCGGACGCC
GATCTGGAAA ATACGCTGAT CGAGTTGGTG AAGGTGCTCG GCAACGAGGT GCCCAGACAG
GATGCAACGG AATTCATGGA CGCCCTGTTG CGGGCATTTC CTGCCATGGC GGATCTCCAT
TTCGCAGCCG CCCTTCTCGC CGCCAACCAG GGCGAGTTCC AGCAGGCTCT GAGCGAAACC
GAGGAGGCCC TGAAGCTGCA TCCGGACTGG GGCCGGGCCC GAGTACTGCA GGCACAGGTC
ATGGCGCAGA TGGGTGATTC GGCGACTGCC GGGGACCTGA TACAGCGCGC GCTCAAGCGC
GATCCGGACA ATGCCAGGCT GCGCCTGATC TACTCTCAGT TTCTCATCAA GTCCGGTGAC
ATCGAAGGGG CGCGGCGGGA GCTGGAGCGT ATCGTAGCCA AGGAGCCCGG CAATCAGGAC
GCCCGGTTCG GACTCGGATT GGCGCTCATC GATCTGGGCC GGCTCGATGC GGCCCGCCGC
GAGTTCGCGG CGCTGGCCGC GTCCGAAAAA TGGCGGGTTC AAGCCTACTT TTATCTGGGG
CTGATCGATG CCCGCAAGGG CAGATTGAAT GAGGCGCTGG ACTGGTTCGA CCGCGTCACG
ACCGGTCCGA CCGAGTTCGA TGCCCGGGTG AACGGCATTA CCGTCCTGAT CAGCCTGGGC
CGTTTGACAG AGGCGCGAAC CCGGCTCGCC GACATCCGCC GGCGTTTTCC GAACGAATCG
GTTCGCCTGT ATCTCCTGGA GGCCGAGCTG CTTTCCAAGA ACCGAGACTA CGAAGATGCC
TTCAATCTGT TGACCGATGC GCTCGGCGAG AATCCGGGGC AGAGCGATCT GCTCTATGCC
CGGGCTCTGG TGGCGGAGAA CCTCGGTCGC TTCGACGTCC TGGAAGCGGA TTTGCGCCAG
GTGCTGGAAA AGAGCCCCGA TGATCCCAAC GCGCTGAACG CCTTGGGTTA CACGCTCGTC
GAGCGGGGCG AACGGTTGGA CGAGGCCAAG GGCTATCTCG ATCGGGCGAT CCGGCTCAAG
CCCGATGACC CGGCGATACT CGACAGTTAC GGCTGGCTGC TGTACCGGCT GCGCAAGTAT
GCCGAAGCCA TCGAATACCT CCGCCGGGCC TATGACAAGG TTCAGGATCC AGAGATCGCA
TCGCATCTGG GTGAGGTTCT GATGGAGTCA GGCCGGCGTC AGGAAGCCCG GAAAATCCTG
CGCGAGGCAT GGAAGAAGGC GCCCGAGCAT GAGGACATGC AGCGGATCAG GGCGCGCTAT
CCGGAACTGC TGGCGCCGTG A
 
Protein sequence
MPKIRIRKFW PSGAGTVPGV LLIALALGCV GHRAKFADET FSSDEAEIRS AVEPLPPKAQ 
LVYLVLAGEL AGQRGRYEVA LEHYLQAARL SRDGRLAERA MQIALFIKKY PEAVESVALW
LKAEPRHAGA RRMATLLYLK EGRRDEAVTQ MKVLLTLPDA DLENTLIELV KVLGNEVPRQ
DATEFMDALL RAFPAMADLH FAAALLAANQ GEFQQALSET EEALKLHPDW GRARVLQAQV
MAQMGDSATA GDLIQRALKR DPDNARLRLI YSQFLIKSGD IEGARRELER IVAKEPGNQD
ARFGLGLALI DLGRLDAARR EFAALAASEK WRVQAYFYLG LIDARKGRLN EALDWFDRVT
TGPTEFDARV NGITVLISLG RLTEARTRLA DIRRRFPNES VRLYLLEAEL LSKNRDYEDA
FNLLTDALGE NPGQSDLLYA RALVAENLGR FDVLEADLRQ VLEKSPDDPN ALNALGYTLV
ERGERLDEAK GYLDRAIRLK PDDPAILDSY GWLLYRLRKY AEAIEYLRRA YDKVQDPEIA
SHLGEVLMES GRRQEARKIL REAWKKAPEH EDMQRIRARY PELLAP