Gene Mmar10_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1403 
Symbol 
ID4284635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1538165 
End bp1539661 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID638140885 
Productanthranilate synthase, component I 
Protein accessionYP_756633 
Protein GI114569953 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.539776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.000291394 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCCGC ACGCGGTCAT CACGCCTGAC TTCGCAACCG CTGCGGCCCA GCTGGAGCGG 
GGTGAAACCT GCGTCATACA GGCGCGGCGG GTCGATGACC TGCTGACCCC GGTCGCCGCC
TATCTTCGAC TGGCTGACAG CCAGCCCAAC ACCTTCCTGC TCGAATCGGT CGAGGGCGGC
GCCTGGCGTG GACGCTATTC CGCCATCGGC CTCGACCCAG ACCTGATCTG GCGTTGCCGC
GATGGTGTGG TCAGTGAAGC GCGCGGCATG GACGTCGCCA AACGCCGTTT CTCGCCGATC
GAGGCCGCCC CGATGGAAGC CCTGCGCGAG GTGATCGAAG CGGCCCATTG TCCCCTGCCG
ATCGATGCCC CGCCCTTGGC CTCCGGCCTG TTCGGCTATC TGGGCTATGA CATGGTGCGC
TACCTGGAAC GCCTGCCCGA AGGCGCAGCA CCGGACCCGC TCGGCCTGCC CGAATCCATC
CTGCTGCGCC CGCAGACCAT GGTCGTGTTC GATGCGCTCA AACAGGAAAT CCAGGTCTAC
TGCCCGGTCC GCCCCGGCGA GTACTCAGCG CGCGAGGCTT ATGACGCTGC TGTCGAACGC
CTGCAGACGA CCTTGCAGAA ACTGGCCGGT GCAACGCCCG AAAAGGCGGC GCCAACCGGC
GAGCTGGGCC CGCGCCAATC CAATCGCAGC CCTGACGATT ATCGCGCCGC GGTCGACAGG
GCGCGTGACT ATATCCGTGC CGGTGACGCC TTCCAGGTCG TCCCCAGCCA GCGTTTCTCC
GCCGACTATC CGGCCGATCC GTTCTGGCTC TACCGCTCGC TGCGACGCCT GAACCCCTCG
CCTTTCCTGT TCTTTTTCCG CTTTGACGGG TTCGAAGTGG TCGGCTCCAG CCCGGAAATC
CTGGTGCGGT TGCGCGATGA TGTCGTCACC ATCCGGCCCA TCGCCGGCAC CCGTCCGCGC
GGCAAGACCC CGGCCGAGGA TGACGCCTTT GAGGCCGATC TGCTGGCCGA CCCGAAAGAG
CGCTCGGAAC ACCTCATGCT GCTCGATCTG GGCCGCAATG ATGTCGGCCG AATTGCCAAG
CCCGGCTCGG TCCGCATCAC CGCCCGCGAG ATCGTCGAAC GCTATAGCCA CGTCATGCAC
ATTGTCTCGA ACGTCGAGGG TGATCTGCGC GACGACCAGG ATGTCGTCTC GGCCCTGTTT
GCCGGCTTCC CGGCGGGCAC AGTGTCCGGT GCCCCCAAGG TCCGGGCGAT GGAGATCATC
GATGAGCTGG AGCCACACCG GCGCGGCGTC TATGCCGGGG CGGTCGGCTA TTTCAGTGCG
GGTGGCGGCA TGGATACGGC GATCGCGCTG CGCACGGCGG TGTTCAAGGA CGGGCGCATG
CATGTTCAGG CCGGTGCCGG CGTGGTGCTG GACAGCGATC CGGAATCCGA GCGTGTCGAG
ACCGTCAACA AGGCCGAAGC GCTATTCCGC GCCGCGATTG ATTCCTTCGG CCACTGA
 
Protein sequence
MTPHAVITPD FATAAAQLER GETCVIQARR VDDLLTPVAA YLRLADSQPN TFLLESVEGG 
AWRGRYSAIG LDPDLIWRCR DGVVSEARGM DVAKRRFSPI EAAPMEALRE VIEAAHCPLP
IDAPPLASGL FGYLGYDMVR YLERLPEGAA PDPLGLPESI LLRPQTMVVF DALKQEIQVY
CPVRPGEYSA REAYDAAVER LQTTLQKLAG ATPEKAAPTG ELGPRQSNRS PDDYRAAVDR
ARDYIRAGDA FQVVPSQRFS ADYPADPFWL YRSLRRLNPS PFLFFFRFDG FEVVGSSPEI
LVRLRDDVVT IRPIAGTRPR GKTPAEDDAF EADLLADPKE RSEHLMLLDL GRNDVGRIAK
PGSVRITARE IVERYSHVMH IVSNVEGDLR DDQDVVSALF AGFPAGTVSG APKVRAMEII
DELEPHRRGV YAGAVGYFSA GGGMDTAIAL RTAVFKDGRM HVQAGAGVVL DSDPESERVE
TVNKAEALFR AAIDSFGH