Gene M446_5398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5398 
Symbol 
ID6132448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5930807 
End bp5932321 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content74% 
IMG OID641645532 
Productanthranilate synthase component I 
Protein accessionYP_001772148 
Protein GI170743493 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.124058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTCA CGCCCCCGCT CGACGCCGCG CAAGCCGCCC TCGCGGCCGG CACGCCCGTG 
CTCCTGCGCG CCACGCTCGT CGGCGACCTG GAGACCCCGG TCGCGGCCTT CCTCAAGCTG
AGGGCGGGGC GCGAGGGCGC GGCCTTCCTG CTCGAATCCG TCGAGGGCGG CGCCGTGCGC
GGGCGCTACT CGATGATCGG CCTCGACCCC GACCTCGTCT GGCGCTGCGG CGGCGGCCGG
GCCGAGCGGG CCGACGCGCC CGCCCTCGAC CGCTTCGTCC CCGACGACCG CCCGCCGCTC
GAGAGCCTGC GCGCCCTCAT CGCCGAGTCC GCCCTGCCGC GGGACCCCGC CCTGCCGCCG
ATGGCCGCGG GCCTGTTCGG CTATCTCGGC TACGACATGG TGCGGGAGAT GGAGCGGCTC
GCCCCGCCGA AGCCCGACCC GATCGGCGTG CCGGACGCCA TCCTGGTCCG CCCGACCGTG
ATGGTGGTGT TCGACGCCGT GCGCGACGAG ATCGCGGTGG TCACCCCGGT CCGCCCCGCG
GCGGGCGTCG CGCCCCGCGC CGCCTGCGAG GCCGCCCTCG CCCGGCTGGA GGCGGTCGCC
GAGGCGCTCG AAGGGCCGCT CCCCGTCGAG GCCCGCGCCA ACCCGGCCGA GATCCCGGCC
CCCTCCCCGG TCTCGAACAC CGCGCCGGAG GCGTTCCACG CCATGGTGGC GCGGGCCAAG
GAGTACATCG CGGCCGGGGA CATCTTCCAG GTCGTGCTCT CGCAGCGCTT CGAGGCGCCC
TTCGCGCTGC CGGCCTTCGC GCTCTACCGC GCGCTGCGCC GGGTGAACCC GGCCCCCTTC
CTGTGCTACC TCGATTTCGG CGCCTTCCAG ATCGTCTGCT CCTCGCCCGA GATCCTGGTG
CGGGTGCGCG ACGGCAAGGT CACGATCCGC CCGATCGCCG GCACCCGCCG CCGCGGCGCC
ACGCCCGAGG AGGATCGGGC GCTCGCCGAG GACCTCCTGG CCGACCCCAA GGAGCGGGCC
GAGCACCTGA TGCTCCTCGA TCTCGGCCGC AACGACGTCG GGCGGGTGGC CGAGATCGGC
AGCGTGTCGG TCACCGAGTC GTTCTTCCTG GAATATTACA GCCAGGTGAT GCACATCGTC
TCGAACGTGG AGGGCCGGCT CGACCCGCGC CACGACGCGC TCGGCGCCCT GGTGGCGGGT
TTCCCGGCCG GCACCGTCTC GGGCGCCCCG AAGGTGCGGG CGATGCAGAT CATCGACGAG
CTGGAGCGCG AGAAGCGCGG TCCCTACGCG GGCTGCATCG GCTATTTCGG CGCGGACGGG
CAGATGGACA CCTGCATCGT CCTGCGCACG GCCGTGGTGA AGGACGGCCG CATGCACGTC
CAGGCGGGCG CCGGGATCGT GCACGATTCC GATCCGGCCT CCGAGCAGCA GGAATGCGTC
AACAAGGCGA AGGCCCAGTT CCGGGCCGCC GAGGAGGCCG TGCGCTTCGC CGCCCAGGCG
CGGCGGGGGC AGTGA
 
Protein sequence
MLVTPPLDAA QAALAAGTPV LLRATLVGDL ETPVAAFLKL RAGREGAAFL LESVEGGAVR 
GRYSMIGLDP DLVWRCGGGR AERADAPALD RFVPDDRPPL ESLRALIAES ALPRDPALPP
MAAGLFGYLG YDMVREMERL APPKPDPIGV PDAILVRPTV MVVFDAVRDE IAVVTPVRPA
AGVAPRAACE AALARLEAVA EALEGPLPVE ARANPAEIPA PSPVSNTAPE AFHAMVARAK
EYIAAGDIFQ VVLSQRFEAP FALPAFALYR ALRRVNPAPF LCYLDFGAFQ IVCSSPEILV
RVRDGKVTIR PIAGTRRRGA TPEEDRALAE DLLADPKERA EHLMLLDLGR NDVGRVAEIG
SVSVTESFFL EYYSQVMHIV SNVEGRLDPR HDALGALVAG FPAGTVSGAP KVRAMQIIDE
LEREKRGPYA GCIGYFGADG QMDTCIVLRT AVVKDGRMHV QAGAGIVHDS DPASEQQECV
NKAKAQFRAA EEAVRFAAQA RRGQ