Gene MCA2584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2584 
SymboltrpE 
ID3104398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2767812 
End bp2769299 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID637171720 
Productanthranilate synthase component I 
Protein accessionYP_114990 
Protein GI53803276 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCG AACGATTCCA GGCCTTCGCC GCCCAAGGCT ACAACCGCGT CCCGCTGGCG 
CGGCGCGTCC TCGCCGACCT CGACACCCCT TTGAGCGCCT ATCTCAAGCT CGCCGATGGG
CCGTATTCCT ACCTGTTCGA ATCCGTCCAC GGCGGCGAGC AGTGGGGCCG CTATTCCATC
ATCGGCCTGC CCTGCCGGAC CTGCATCGAA GTCCGCGGCC ATGAGGTCAT CGTGCTGAGG
GATGGCGCCC GCGCCGAGAC CCTGACGGTC GAAAACCCGC TGGCATGGAT CAAGGCTTTC
GGCAGCCGTT TCAAGGTCCC GGACCTCGAA GGTCTGCCGC GCTTCACCGG GGGCCTGGTC
GGCTATTTCG GCTACGAGAC CATGGGCTAC ATCGAGCCGC GCCTGGCCAA GACCAAACCG
GACCCGATCG GCTCGCCCGA TATCCTGTTG ATGGTGTCGG AAGAAGTGCT GGTGTTCGAC
AAACTCACCG GCAAGCTGCT GGTCGTCGTC CACGCCGATC CCAACGAAGC CGGCGCCTAC
GCCAAGGCCC AGACCCGGCT GGACGAGCTG GTGCGTGAAC TGCGCAGCCG CCAGCTCCCG
CCCGCGCCGC CGCGCTCACC GCGCACGGTG GACGAGGCCG ATTTCATCTC CGGCTTCACG
CGGGAAGGCT TCGAGGACGC GGTGCGGCGG GTCAAGGAGT ACATCGTCGA GGGCGACGTG
ATGCAGGTGG TGCTGTCGCA GCGGCTGAGC ATTCCCTACG CCGCCTCGCC CCTGGACCTC
TACCGCGCCC TGCGCTGCCT CAACCCCTCG CCTTACATGT ACCAGCTCAA CCTGGGGGAT
TTCCACGTGG TCGGCTCGTC GCCGGAAATC CTGGTGCGGC TGGAGGACGG CACCGTGACG
GTCCGCCCCA TCGCCGGCAC CCGCCGCCGC GGCCGCAGCC CCGAAGAGGA TCAGGCGCTG
GAGCGGGAGC TCCTGGCCGA CCCCAAGGAA CTCGCCGAGC ATCTGATGCT GATCGACCTG
GGCCGCAACG ACACCGGCCG GATTTCCGAG ACCGGCAGCG TGCGGCTCAC CGAGAAGATG
GTGGTGGAGC GCTATTCCCA CGTCATGCAC ATCGTCTCCA ACGTGACCGG CAAACTCCAG
GCCGGCAAGG ACGCCTACGA CGTACTGGCG GCGACCTTCC CCGCCGGCAC CGTCAGCGGG
GCACCGAAGA TCCGGGCCAT GGAGATCATC GCCGAGCTGG AGCCGGTGAA ACGTGGGGTC
TATTCCGGCG CGGTGGGCTA CATCGGCTGG TCCGGCAACA TGGACACGGC GATCGCCATC
CGCACCGCCG TCATCAAGGA CGGGCGGCTC CACATCCAGG CCGGCGCCGG CGTCGTCTAC
GACTCGGTAC CGCGCAGCGA GTGGGAGGAA ACGATGAACA AGGCACGGGC CATTTTCCGG
GCGGTCGCCA TGGCCGAAGC CGGCGTCGAA GGAGGCGAGA ACGCATGA
 
Protein sequence
MTPERFQAFA AQGYNRVPLA RRVLADLDTP LSAYLKLADG PYSYLFESVH GGEQWGRYSI 
IGLPCRTCIE VRGHEVIVLR DGARAETLTV ENPLAWIKAF GSRFKVPDLE GLPRFTGGLV
GYFGYETMGY IEPRLAKTKP DPIGSPDILL MVSEEVLVFD KLTGKLLVVV HADPNEAGAY
AKAQTRLDEL VRELRSRQLP PAPPRSPRTV DEADFISGFT REGFEDAVRR VKEYIVEGDV
MQVVLSQRLS IPYAASPLDL YRALRCLNPS PYMYQLNLGD FHVVGSSPEI LVRLEDGTVT
VRPIAGTRRR GRSPEEDQAL ERELLADPKE LAEHLMLIDL GRNDTGRISE TGSVRLTEKM
VVERYSHVMH IVSNVTGKLQ AGKDAYDVLA ATFPAGTVSG APKIRAMEII AELEPVKRGV
YSGAVGYIGW SGNMDTAIAI RTAVIKDGRL HIQAGAGVVY DSVPRSEWEE TMNKARAIFR
AVAMAEAGVE GGENA