Gene Mmwyl1_2967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_2967 
Symbol 
ID5365839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp3351694 
End bp3353118 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content44% 
IMG OID640805340 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001341813 
Protein GI152996978 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0339276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCAA TCAAACACGT AAATCTCCCG TATCAGGCGT CATTACTGTC TTTCTTTTCT 
GCTGTGCGAG ACTTACCTTA TCCCGCTTTA CTCGACAGTA ATCACGAACA TTTCCCTGAC
ACAAATTACG ACATTTTAGT CGCTAACCCG CTTGCTCGCA TCCATCCAGT TCAAAATAAC
CAGCCGATCA CTTGGTACAG CAAACCACTT TATGAGCTAA GCAATACTGA CAACCCAATG
ACCTTGTTGA ATGAATTAAT GAGCCTTATT TGCCAAGAAC CTTGGGCGAA ATCTGCTCCG
AAAGATTTAC CTTTTGTTGG CGGTTTACTG GGCTATTACG GTTACGAAAG TGGGCATTTT
GTTGAACAAT TACCCGATAC AGTAGAACAT GATATCAAGC TAGAAACTTT GAGTATTGGT
CTTTATGGCT GGGCAGTAAT TACTTGCCAT AAGACAAAAA ACACCCAATT GATTACCTCA
CCTTGGTGTT CTTCTGAAGA CGTTGCCGAT CTTATCAACC GCTTTACCAG TGCGTGTGAT
GATGTGTTGG TAAAACACGA TTTAAATGAA GCAAGACCTT TTCGTTTACA AGCACCTTTT
ACTTCGAACA TGACAGAAGC CGACTACGCG CAAAAATTTT CTGCTGTTCA AGACTACATT
CAGTCGGGTG ATTGTTATCA GGTCAATTTG GCGCAGCGTT TTTCCACTCG TTACAAAGGC
GACACTTTCA CGGCATATCA CACATTAAGA GACGTTTGCC CGACACCATT TTCGGCTTAT
ATGGAATTTT CGCCAGAGCA AAGTTTGCTA AGTCATTCAC CTGAACGCTT CCTCTTATGT
GATCAAGGCC GAGTAGAATC CAAACCAATC AAAGGCACTG TAGCACGCGG CAAAACACCT
GAAGAAGACA AGGCCAACGC AGATTGGTTA CTTGCATCAA CAAAAGATCG TGCCGAAAAC
CTGATGATTG TCGATTTGCT GCGCAACGAC CTCGGCCGAA CCTGTTTAAC TGGCAGCATC
AAGGTGCCAA AGCTGTTTGC TTTAGAGAGT TATGCCAACG TACATCATTT GGTTTCCACG
GTAGAAGGTA GGATTGATCA AGCAGATCAA GCGATTCGTG TTTTCCATCA AAGCTTCCCT
GGCGGCTCCA TTACTGGCGC ACCTAAGATT CGCTCGATGG AAATTATCGA TGAGCTAGAA
CCTCATGAAC GTTCAGCCTA TTGTGGTTCT ATCGCTTACT TCAGTGCGAA TGGTCAAATG
GATTCCAGCA TTACGATTCG CACATTGGTC GCAGATCATG GCAACTTGCA CTGTTGGGCA
GGCGGTGGAC TGGTCGCGGA TTCGAAATGC CAAGAAGAAT ACCAAGAAAC CTTTACCAAA
GTGGGCAAGT TAACCCATAC TCTAGAACAA GACTTTTTGA AATAG
 
Protein sequence
MSSIKHVNLP YQASLLSFFS AVRDLPYPAL LDSNHEHFPD TNYDILVANP LARIHPVQNN 
QPITWYSKPL YELSNTDNPM TLLNELMSLI CQEPWAKSAP KDLPFVGGLL GYYGYESGHF
VEQLPDTVEH DIKLETLSIG LYGWAVITCH KTKNTQLITS PWCSSEDVAD LINRFTSACD
DVLVKHDLNE ARPFRLQAPF TSNMTEADYA QKFSAVQDYI QSGDCYQVNL AQRFSTRYKG
DTFTAYHTLR DVCPTPFSAY MEFSPEQSLL SHSPERFLLC DQGRVESKPI KGTVARGKTP
EEDKANADWL LASTKDRAEN LMIVDLLRND LGRTCLTGSI KVPKLFALES YANVHHLVST
VEGRIDQADQ AIRVFHQSFP GGSITGAPKI RSMEIIDELE PHERSAYCGS IAYFSANGQM
DSSITIRTLV ADHGNLHCWA GGGLVADSKC QEEYQETFTK VGKLTHTLEQ DFLK