Gene Mmar10_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1239 
Symbol 
ID4286404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1348917 
End bp1350398 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content61% 
IMG OID638140718 
Productpentapeptide repeat-containing protein 
Protein accessionYP_756469 
Protein GI114569789 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.101838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0181716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAT TGATTTCGAT TTCGATCATG GGACTTTCGT TGCTGACCTC GACCGCTGCG 
GTCGAGGCGC AGGCCAATCA GGAGCGGGTG ATCCCCGAAC ATATTCTTGT CCGGGTCATG
CCGCGAGATC TGGTGGGGGT CGATTTGAGC GGCGCCGCGC TGGCAGATGT AAGCTTCGAG
CGGGTTACCA TGTCCGGCGC GAACATGTCC GGCGCAAATC TCAGCCGATC CCGGTTTCCG
GATGCCAATC TGGACCGCGC GGATCTCTCG AATACCGATC TGACCGAGGC TGACCTCTCC
ACGGGAAGAT TTGTCGGCGC AAACTTTCGA GGTGCTTTGC TGCGCAACAC CTCCCTGACC
GGTGCCAATC TCACCGGCGC GGATCTGACA GGGGCCCGGG AACTGGGCTA TGAGATCAAT
CAGGCCCGCC TTTGCAATAC TCGCCTGAGC GCAACAGTGG TCCTCAATCG CGATTGTGTG
GAGCTTGGAG TGCGTGCCGA GACGCGCCAT CTCGTTGATC GCAACGCCCA GCAGCGTCTC
CTGCAAAACC GGGCCTGTGT CGGCTGTGAT CTTCGCTTTG CCCAGGCCGC CAACGCGTCG
CTGATTGGCG CCGATATCTC CCGCGCCCTG CTGACCGATG CCAATCTGTC GGGCGCGAAC
CTGTCTGGGG CCAACCTCGT CGACGCCGAG CTTACCCGCA CCAATTTTGC GCGTGCCAAT
CTGTCGAATG TGAGTTTCTC CGGCCGTCCA TTGCCGCGCA ATGTGATATT CACCGGCACC
ATTCTTCGTG GCGCCAATCT TTCTGGCCAA GACCTCACGG GGCTCGACTT TCAGGGCGCC
GACCTGTCCC GAGCCAATGT CGAGGCGGCC GAAATCGATC GCACGACATT GCTCCGGGAT
GCCCGCCTGA CCGGACTGGA CCTGAGCCAT GCCTCCTTGA GCCAGGCGAT TTTTCCGGGA
AATGACTTTC GCACCATCAA CCTGACCGGT GTGCAGATTT ACGGGATGGT CCTGACCGGC
GCCAATTTCA CCTCTGTCGA GTTGTCGAAT GCCCGGATTG TTGAGAGCAA CATGCAACGG
GCCATCCTTG CCGGTGCCAA TCTGTCCTAC GCCGACCTTT CCGGGATTGA CCTTGCTGGC
GCCGATCTGA CCGGTGCCGA CTTGAGTGGA GCCAACTTGA TCGGCGCTGA TCTGACAGGC
GCCAACCTGA CTCGCGCCAA CCTGACAGGG GCCATCCTGT TCGGGACCGA TCTGACGCGT
GCGATCCTGG CCAATGCCCG GTTGAACTCG GCTCAACTGG TCGGCGCCCA GCTAAGCGGC
GCCCGGCTCG ACTCGGCGGA CCTGACGGAT GCCAATCTTT TTGGCGCGCA GAACGCGGCC
AGCATCCCGG TCTCCGGCAC CATGACCTTC TGCCGCACCA GGATGGCAGA CGGGAGTGAC
CGCAGCTCCA GCTGTGGGGC TGCGGTTTCG TCGCCGAAAT AG
 
Protein sequence
MRALISISIM GLSLLTSTAA VEAQANQERV IPEHILVRVM PRDLVGVDLS GAALADVSFE 
RVTMSGANMS GANLSRSRFP DANLDRADLS NTDLTEADLS TGRFVGANFR GALLRNTSLT
GANLTGADLT GARELGYEIN QARLCNTRLS ATVVLNRDCV ELGVRAETRH LVDRNAQQRL
LQNRACVGCD LRFAQAANAS LIGADISRAL LTDANLSGAN LSGANLVDAE LTRTNFARAN
LSNVSFSGRP LPRNVIFTGT ILRGANLSGQ DLTGLDFQGA DLSRANVEAA EIDRTTLLRD
ARLTGLDLSH ASLSQAIFPG NDFRTINLTG VQIYGMVLTG ANFTSVELSN ARIVESNMQR
AILAGANLSY ADLSGIDLAG ADLTGADLSG ANLIGADLTG ANLTRANLTG AILFGTDLTR
AILANARLNS AQLVGAQLSG ARLDSADLTD ANLFGAQNAA SIPVSGTMTF CRTRMADGSD
RSSSCGAAVS SPK