Gene Mmar10_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2603 
Symbol 
ID4285951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2814747 
End bp2816309 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content62% 
IMG OID638142102 
Producthypothetical protein 
Protein accessionYP_757827 
Protein GI114571147 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.438803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCC AAGGCCTGTT CAATCTTGGC GCCCGCCTTT GTCGCGAAAC CCGCGGCAAT 
GTCGCGACGA TTTTTGCCCT CACCCTGGTC CCGGTCGCCC TGTTGTCAGG TGGCGCGGTC
GATCTAAGCC AGTCGATGAA TGCCCGCTCG CGCCTGGCCC AGGCCCTCGA CGCAGCCGCC
CTCGCAGTCG GCGTCAATAC CAACCTGTCG AGCTCGGAAG CGACCGGTAT CGCCAATGAT
TTCATCGCCG CCAACTATCC CGGCCGCGAG CTCGGCGTCG TCCAGAACGT CAATGTCTAT
ATCGATGACG AAACCGACAC GGTGACCGTT TCGGGCGAGG CCCGTGTGCG CACCACCATG
CTGGGCATGA TCGGTCTCGA CTACATCACG GTTCACTGGG AGAGCGAGGT CCAGCGCGCC
CGCCAGCGGC TGGAGCTGGT CATGGTACTC GACAATACCG GCTCGATGGG TGGCTCGAAG
ATCCGCAATC TGCGTGAAAG CGCCGAGTTG CTGACCGGCA TCCTTTTTGA TGCGGCTGAC
GACCCGAGCG ATGTGAAGAT CGGCCTCGTC CCCTTCGCTG CCACGGTGAA TGTCGGAACC
AATCATGCGC GCGCCTGGTG GATGGACCCG GATGCCTTGT CCCCGGTCCA TGCCGAATGG
GCAGGCGGCA ATCCGGTCGA GATCGAGACC TGCTCCGGCC GGGGCCGGGG CCGGCGCCGA
CGTTGCCAAA CCGAAGAGAT CTGGGTCAAT CACTGGGACC TGTTCGACCA GCTTCGCAAT
ACCGGGTGGG AGGGCTGTGT TGAAGCTCGG CCGATCCCGA TGGATATCGA CGATACACCA
CCCAGCATCG GCAATCCGAG CACGCTCTTT GTGCCGTATT TCGCGCCGGA CGAGCCGGAT
AACGGCAGCT ATTCCAACAG CTATCTCAGT GACGGTGTCA GTGGCGGTGT CAGCGAGCGA
CTGCAGGCGC TCGACAAATA CGATAATGGC CGACCGAACC GCGAAGGGCC GAACCGGTCC
TGCACCACGA CACCGGTCAC CCCCCTGACT TCGACCGAGC GCACGGTGCT CAACGCAATC
GGCGACATGG GCGCCAGCGG CACGACCAAT ATCCCCAATG GTGTCGGCTG GGGTATCCGG
CTCATCTCGC CCGGCGCGCC CTTCACCGAA GGCTCCGCCT GGGACGATGA TGAATACATC
AAGGCGATGG TGATCCTGAC CGATGGCGAC AATGTCATGA GAGGGCGCAA TACGGACCAA
ATGTCGGACT ATGAAGCCTA TGGATTTGTT GCCGACGGGC GCCTTGGACG TCGGTCTTCG
AGTAGCAATG TCCTGTCCAA CGAACTCGAC GACCGCACCG AAGCAGCCTG CGCCTATGCC
AGGTCACTGG GCATTCGCGT CTACACCATT ACCTTCCAGG TCAATTCCAG CTCCACCCGA
AGCCTGATGC AGAACTGCGC CAGCAATCCG AGCCTGTATT TCGACTCGCC CTCATCGGAA
GCGCTGGAAG ACGCCTTCGA GATGATCGCC GGTGACCTGA CAAACCTGCG CCTGTCGCGT
TAA
 
Protein sequence
MPFQGLFNLG ARLCRETRGN VATIFALTLV PVALLSGGAV DLSQSMNARS RLAQALDAAA 
LAVGVNTNLS SSEATGIAND FIAANYPGRE LGVVQNVNVY IDDETDTVTV SGEARVRTTM
LGMIGLDYIT VHWESEVQRA RQRLELVMVL DNTGSMGGSK IRNLRESAEL LTGILFDAAD
DPSDVKIGLV PFAATVNVGT NHARAWWMDP DALSPVHAEW AGGNPVEIET CSGRGRGRRR
RCQTEEIWVN HWDLFDQLRN TGWEGCVEAR PIPMDIDDTP PSIGNPSTLF VPYFAPDEPD
NGSYSNSYLS DGVSGGVSER LQALDKYDNG RPNREGPNRS CTTTPVTPLT STERTVLNAI
GDMGASGTTN IPNGVGWGIR LISPGAPFTE GSAWDDDEYI KAMVILTDGD NVMRGRNTDQ
MSDYEAYGFV ADGRLGRRSS SSNVLSNELD DRTEAACAYA RSLGIRVYTI TFQVNSSSTR
SLMQNCASNP SLYFDSPSSE ALEDAFEMIA GDLTNLRLSR