Gene Mmc1_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_3303 
Symbol 
ID4480667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp4116218 
End bp4119514 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content53% 
IMG OID639724053 
Producttetratricopeptide domain-containing protein 
Protein accessionYP_867198 
Protein GI117926581 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA TAACGTTTTA CTCTTTTAAG GGGGGTGTGG GCCGCACTGC ACTGCTGATG 
CATTTGGCGA TACAGTGGGC GGACCGGGGG GCTATTGTGG CGGTGGTGGA TATGGATCTT
CAGGCTCCTG GTCTGAGCTT TCATCCATGG CTGCAAGCCC ACAGCGACCC TAAACTGCAA
GAGCGTGGAA TGAGTGATCT ACTCTCCATC TACTACAGTG AGCAAGCACT TGAGCAGGAG
CCCCCTCTCT TTGGTTTTTA CCCCCCCAGC AAGCTTTTGC GAGAGGTCCG CGACCCCAGC
GGTAAATGGG GCAGTGGTGG CCGGCTCTAT GCCCTGCCCG CAGGGGGTCT AAGCCTACCC
CAAGCGGACC ACTATGAGCC CACGGCTGAT CATCATTTTC CAGCAGACAC TGACCGTGCA
GAGCGCACAA CGCCCCCCCC CGCCCAACGC ACGCTACACG CCTTTGCGGA CCGTTTTCGA
AAAGATTTGG AGCAATTTAA AACCCCAGAC AGCAAACGCC CCATTGACCT GCTCCTCATC
GACAGCCGTA CCGGTTATCT GGAGATGGAA GATCTGGCTG TGGGGTATCT GGCCGATCAA
GTTGTGCTCA TCTCTGGCCT AAATGGTCAA AACCTGCGTG GCCTAGCGCT TACCCTAAAG
GCTTTAAGAC CCCAACGCAT TCCTGCGGGG CAATTTGCCA ACCAGGCCCT TTGTGTCTTC
TCCCCGGTAC CAGCCCACTT GCATGACGAC CCCGCAGCCC TGGCGGCGTT ACAAACCGGT
TATGATCTGC TGGAAGAGGC GTGCATCCAA GAGGAGGGGC TGCAAGAGGA GCTACCCCCT
AAAGTATTCA CCCTACCCTA CACACCCCAT TTGGCCATAT CGGACATCCC TCTGCCCCGT
GGCAGTTCGC CTGCCGGAGC ACACCCTTAC TGGCAGAGTG TGGTGGAGAT CGCCAATGCG
TTACTGCCCC AGCAGCAGCC TGAGCAGTTA ATGGCGGAAC AACGCCAGGA GACAATCCGT
ACACTGGGCT TACATGCGCA CACCCCACGC GAGCCACGGA GCCAAGAGGG GCCTTATATG
GGGCCGCCAC TAAAACCCAG CTCTTCCAAC AACCGACCCT TGGCCAACCT TTTCCGACTG
CCTAAGTGGG ACTGGCCCTT TAAAAATTCT CCTGACCTTA AACGTACATG GCTCAGCAAG
GTGCCCTCAA GTAAGGGGCA CACAAAGCAG TTTGAACCTC TGCTCAATGG ACTTTGCATC
TCTCTCTCTT TGAAAATCGA AGAGAAGAAG AAAATCTTTG ATAATTTAGA CAATCTAAGC
CTGTACCAAC AAACAGAACT TGTGAAAATA TTCCAAGAAG AGCAGTACAA ACTTTCATCT
CTTGACGTAA AGCATGAAGA GCAACTCTTC ACCCAATTAT TTGAACATCA AAGAGCATGG
GCTGGCCTTA TGCTAGAGAG CCCAACGGCG GGGGATAGGG CGCTTTTAGA GTGGCCTTTA
GAGGGCACCG ATGCCTTTAT TGCTTGGCGG GAGACACCCT ACTACTGGCG GGCTATTCTG
GAAAATGTCA TCCATGGAAG CGATCTACTT AAAGAGGCGC GGCACGCTAC CCTACAGGCT
GGCATAAAAA GGGCAGGCCA ATATATACAC AAGCAATGGA AACCCCATTC TTTGCAGGGG
GTTTTATCGC TGATTGAATT AATTCACCAG CATGCCCCTG ACTTGACCAC CCCCCTGTTA
GAACGCGTAA AAACGCTGGC CGACTTGGAC CATACAGGCA CGGCTTGGCG GAGTGTTGCC
CAGTTCTACC AGCATGTTAT TCGATGTGAT GGCTCCGCCA GCTCAGCTTA TGAACAAGCG
CTGGCAAAAA ATCCGCAGGA TGCGTGGACA GCGGCTGATT TTGCACAGTT TTTAGCCCAA
TCAGGCAAAG ACCTAGAACG GGCCGAAGCG CTTTACCAGC AAGCCATCGA GGTCGACCCG
AATGACGCCG GCATCCTCAA CAACTTTGCC CTCTTCATGA CCGATAAAAA GGGTGATCAT
GCGCAGGCTG AAATTCTCTA CAACAGAGCC ATCGAGGCCG ATCCAAATGA CGCCATCGCC
CTAGGCAATT TTGCTCACTT CATGACCAAG ATTAAGAGCG ATCATGCGCA GGCTGAAATT
CTCTTCAACA GAGCCATCAA GGCCAATCCG AATCACGCCA AAGCCCTAGG CAATTTTGCT
ACCGTCATGA CCAAGATTAA GAGCGATCAT GCGCAGACTG AAATTCTCTT CAACAGAGCC
ATCGAGGCCG ACCCGAATGA CGCCAAAGCC CTAGGCAATT TTGCTACCTT CATGACCAAT
ATAAAGGGGG ATCATGCGCA GGCTGAAATT CTCTTCAACA GAGCCATCGA GGCCGATCCG
AATAATGCCA ACAACCTAGG CAATTTTGCT CACTTCATGA CCAATATAAA GGGGGATCAT
GCGCAGGCTG AAAGACTCTA CAACAGAGCC ATCGAGGCCG ATCCGAATCA CGCCAACAAC
CTAGGCAATT TTGCCCTCTT CATGACCAAT ATAAAGGGGG ATCATGCGCA GGCTGAAATT
CTCTTCAACA GAGCCATCGA GGCCGATCCG AATCACGCCA ACAACCTAGG CAATTTTGCT
CACTTCATGA CCGATAAAAA GGGGGATCAT GCGCGGGCTG AAATTCTCTA TACCAGAGCC
ATCGAGGCCG ATCCGAATAA CGCCAAAATC CTCAACAACT TTGCTAACTT CATGACCTAT
ATAAAGGGGG ATCATACGCA GGCTGAAATT CTCTACAACA GAGCCATCGA GGCCGCTCCA
AATAACGCCA ACGCCCTAGG CAATTTTGCC CTCTTCATGA CCAATATAAA GGGGGATCAT
GCGCAGGCTG AAATTCTCTT CAACAGAGCC ATCGAGGCCG ATCCGAATCA CGCCAACAAC
CTAGGCAATT TTGCTTGGTT TTTGCTCGGT CGCGGACGCC TTGAAGAGGG ACGCCTCCAA
TTAGAAAAAT CGCTCTCGCT CCAACAAGAA GAGGCGGATC CTACCCTGCA AGCTGAGACC
CGCTTTTATC AATACGCTCT TGGCCCGGCC CACCAGCGCA GCGAAGCCCT GGCTGTACTG
CGGCGGGTAC TCATGGAGAA ACAAGGGCGC TCTCCAGGGT GGGATTTTGC TGGCATCCTT
GAACAAGCTG AGCAAAGGGG CCACCCCCAT TACCCGTGGT TAATCCGCTT GGCCCAGGTA
ATCTCTGCCG GTGCACCCGT AGAAACCCTA GAGCCATGGC CCGACACCAC GACTTGA
 
Protein sequence
MKIITFYSFK GGVGRTALLM HLAIQWADRG AIVAVVDMDL QAPGLSFHPW LQAHSDPKLQ 
ERGMSDLLSI YYSEQALEQE PPLFGFYPPS KLLREVRDPS GKWGSGGRLY ALPAGGLSLP
QADHYEPTAD HHFPADTDRA ERTTPPPAQR TLHAFADRFR KDLEQFKTPD SKRPIDLLLI
DSRTGYLEME DLAVGYLADQ VVLISGLNGQ NLRGLALTLK ALRPQRIPAG QFANQALCVF
SPVPAHLHDD PAALAALQTG YDLLEEACIQ EEGLQEELPP KVFTLPYTPH LAISDIPLPR
GSSPAGAHPY WQSVVEIANA LLPQQQPEQL MAEQRQETIR TLGLHAHTPR EPRSQEGPYM
GPPLKPSSSN NRPLANLFRL PKWDWPFKNS PDLKRTWLSK VPSSKGHTKQ FEPLLNGLCI
SLSLKIEEKK KIFDNLDNLS LYQQTELVKI FQEEQYKLSS LDVKHEEQLF TQLFEHQRAW
AGLMLESPTA GDRALLEWPL EGTDAFIAWR ETPYYWRAIL ENVIHGSDLL KEARHATLQA
GIKRAGQYIH KQWKPHSLQG VLSLIELIHQ HAPDLTTPLL ERVKTLADLD HTGTAWRSVA
QFYQHVIRCD GSASSAYEQA LAKNPQDAWT AADFAQFLAQ SGKDLERAEA LYQQAIEVDP
NDAGILNNFA LFMTDKKGDH AQAEILYNRA IEADPNDAIA LGNFAHFMTK IKSDHAQAEI
LFNRAIKANP NHAKALGNFA TVMTKIKSDH AQTEILFNRA IEADPNDAKA LGNFATFMTN
IKGDHAQAEI LFNRAIEADP NNANNLGNFA HFMTNIKGDH AQAERLYNRA IEADPNHANN
LGNFALFMTN IKGDHAQAEI LFNRAIEADP NHANNLGNFA HFMTDKKGDH ARAEILYTRA
IEADPNNAKI LNNFANFMTY IKGDHTQAEI LYNRAIEAAP NNANALGNFA LFMTNIKGDH
AQAEILFNRA IEADPNHANN LGNFAWFLLG RGRLEEGRLQ LEKSLSLQQE EADPTLQAET
RFYQYALGPA HQRSEALAVL RRVLMEKQGR SPGWDFAGIL EQAEQRGHPH YPWLIRLAQV
ISAGAPVETL EPWPDTTT