Gene Mmar10_2500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2500 
Symbol 
ID4286626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2721521 
End bp2722822 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content62% 
IMG OID638141998 
Productpentapeptide repeat-containing protein 
Protein accessionYP_757724 
Protein GI114571044 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC GCAAGATCGA GTTCAAGAAA CTGACCCAGG ATGAAGTCGA CGCGATTTGC 
GCCCGGCACG AACGCCTGTG GTCGGGGCGC TCCGGTGGCG CACGGGCCAA TTTCTCCTAC
ACCATTCTCG AAAACATGGT GCTGGCCAAT CGGGACCTGT CCGATGCCGA CTTCAGCGGT
GCCGTGTTGC GCGGTGCCGT CCTGTCCGGA GCCACGCTCA ATAGCGCGGT CTTTTTCGCC
GCCGATTTGC GGCGTGCCAA TCTGGAAGGC GCGTCCCTGA GACGGGCGGA CTTGCGCGGT
GCCATCATGC GGGGCGCCAA CCTAACCGGG GCTGACCTGA CCGAGGCTGA TATGCGCGAA
GGTGCGATCG CCCAAATGGA CAAGGAAAAG GGCCTCGCGG TCCTGACCCA CAAAACCCAG
GAAAACGATG CCGGGACCGC CAATTTCACG GGGGCCAATC TCTCGCACTC GAAAATGGCC
GGAATCGCCG CCCAAAAAGC CGACTTCACT GACGCCATGC TGCTGGGCAC CCGTTTGATC
CGGGCCAATC TGCGCGGATC ATCCTTTCAA GGCGCCAATC TCGAAGGCGC AGACATGTCC
GGCGCGGACC TGTCCGATAC TGATTTCACG CATGCCGTTT TGATCGGCAC GAAGACCGTG
ATGGCCAAGA CCCAGGGCAT GAACACCGAC GGCGCGTTGA CCAATGCGGC GGTCGGTATT
GATCCTGAAG CTCTGGAACA GCCTGTTGCC GACCAAATCG CCGAACATGT CCGCTGGTGT
GAAACCAATG GCAAGGAAGG CAAGCCATCC CTGTTCGACG GGACGGATCT CCGTGGCCTG
ACATCCCTCG CCGGGAAAAC ACTGACCGCC TTTCATGCGC GTGGCGCGAC ACTTTACGGC
CTGGACCTGG AGGGCGCGCA GCTGCAGGGC GCTCGCCTTG AAAAAGCCGA CCTGCGGATG
GTCAGCCTGC GTGGCGCGGA CCTGCGTGGA GCCGATCTTA CCGGCGCCAA CCTTTCAAAT
GCCGACCTGC GTGACTGCCA ACTTGGGCCG CTTCTGCTCG ATGGCAACCG GGTTCTTCCG
GCCTCGCTCG CCGGGGTGCG GGCACGCTAT ACCGATCTGC GGGGCGCCGA CCTGCGTCAG
TTGAAGGCGA CCGGCGCCGA TTTTTCCTAC GCCACCCTCA ACGGGACCAA TGTGAAAAGG
GCGAGCTTCA ATGGCTCCTG CTTCACGGGT GCCCGTATCG ACGAGGACTT CTCCACCCAG
GTTGACTCCC TGGACGGCGC CGTCGATCTC GACGCGGCCT GA
 
Protein sequence
MTDRKIEFKK LTQDEVDAIC ARHERLWSGR SGGARANFSY TILENMVLAN RDLSDADFSG 
AVLRGAVLSG ATLNSAVFFA ADLRRANLEG ASLRRADLRG AIMRGANLTG ADLTEADMRE
GAIAQMDKEK GLAVLTHKTQ ENDAGTANFT GANLSHSKMA GIAAQKADFT DAMLLGTRLI
RANLRGSSFQ GANLEGADMS GADLSDTDFT HAVLIGTKTV MAKTQGMNTD GALTNAAVGI
DPEALEQPVA DQIAEHVRWC ETNGKEGKPS LFDGTDLRGL TSLAGKTLTA FHARGATLYG
LDLEGAQLQG ARLEKADLRM VSLRGADLRG ADLTGANLSN ADLRDCQLGP LLLDGNRVLP
ASLAGVRARY TDLRGADLRQ LKATGADFSY ATLNGTNVKR ASFNGSCFTG ARIDEDFSTQ
VDSLDGAVDL DAA