Gene Namu_3078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3078 
Symbol 
ID8448692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3394258 
End bp3395865 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content74% 
IMG OID645042160 
Productanthranilate synthase component I 
Protein accessionYP_003202401 
Protein GI258653245 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00101032 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000123729 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGGCA TGTCCGTCAG CACCCCCGCC CCGCCGCGCG CGGCCCAGCC CGGGCTCGGG 
GAGATCAGCC CCAGCCGGGA GGAGTTCCGC GAGCTCGCCC GGGATCGGCG GGTGATCCCG
GTGACCCGGC GCCTGCTCGC CGACACCATC ACCCCGGTCA GCCTGTACGC GACCCTGGCC
GGCGACCGGC CCGGCACCTT CCTGCTGGAA TCGGCCGAGA ACGGGCGGTC CTGGTCGCGC
TGGTCGTTCG TCGGGGTGTC CGCCCCGGCC GTGCTGACCG AACGCGACGG GCAGGCGACC
TGGCTGGGCA CCCCGCCGGC CGGCCTGCCG ACCAGCGGCG ACCCGCTGCG GGTCCTGGAC
GAGTCGCTGC GCTTCCTGCA CACCGAGCCG CTGGCCGGGC TGCCCCCGCT GACCGGCGGC
CTGGTCGGCT ACCTCGGCTA CGACGTGGTC CGCCGCTGGG AGAAGATCGA CACCGCGGCC
AGCGCGACCC GCCCGCCGGC GCCGGCCGAC CCGGAGATTC CCGAGCTGGT CATGCTGCTG
GCCACCGACC TGGCCGCCCT GGACCACCAC GCCGGCACCG TCACCCTGAT CGCCAACGCG
GTGAACTGGG ACGGCACCGA CGCCCGGGTC GACCAGACCT ATGACCACGC GGTGGCCCGG
CTGCACGAGA TGAGCCGCAC GCTGGCCCAG CCGCGGTCGC TGCCGGCCGC CCACTTCACC
GCCCGCACCC CGCCGGTGCG CCGGCGCACC GAGTCCGCCG AGTACCAAGC CAACGTGGAC
GTGGCCAAGG AGCACATCCG GGCCGGGGAC GCCTTCCAGA TCGTGCTGTC GCAACGGTTC
GACGTGCCCA CCGAGGCCGA CCCGCTGGAC ATCTACCGGG TGCTGCGGGC CACCAACCCG
AGCCCGTACA TGTACCTGCT GCGCCTGCCC ACCCCCGACG GCGGCTCCTT CTCGGTGGTC
GGCTCCTCGC CCGAGGCGCT GGTCACCGTG CGCGAGGGCC TGGTGACGAT GCACCCGATC
GCCGGCACCC GGCCCCGCGG GCACACCGAG GAGGACGACG TCTGGCTGGC CAAGGACCTG
TTGGCCGACG AGAAGGAACG CAGCGAGCAC GTGATGCTGG TCGACCTGGG CCGCAACGAC
CTGGGCCGGG TCTGCGCCCC GGGCACGGTC AAGGTGGTCG ACTTCTTCAC CATCGAGCGG
TACAGCCACG TCATGCACAT CGTCTCGACG GTCACCGGGC AGCTGGCCGC CGACCGCACC
GCCTACGACG CACTGGCCGC CTGCTTCCCC GCGGGCACCC TGTCCGGGGC GCCCAAGCCG
CGGGCCATGC AGATCATCAA CGAGCTCGAA CCGCTGCGCC GCGGCGTGTA CGGGGGAGTC
GTGGGCTACC TGGACTTCGC CGGGGACGCC GACACCGCGA TCACCATCCG TACCGCGCTG
GTGGTCGACG GCACCGCCTA CGTGCAGGCC GGCGCCGGGG TGGTGGCCGA CTCGGTACCC
GAGAACGAGG ACGCGGAGTG CCGGAACAAG GCCGCCGCCG TCATCGCCGC CGTCGGCGCC
GCCGCGACCA TGCAGGTGGT CGGGGCCACG CAGGTGATCG GTGACTGA
 
Protein sequence
MTGMSVSTPA PPRAAQPGLG EISPSREEFR ELARDRRVIP VTRRLLADTI TPVSLYATLA 
GDRPGTFLLE SAENGRSWSR WSFVGVSAPA VLTERDGQAT WLGTPPAGLP TSGDPLRVLD
ESLRFLHTEP LAGLPPLTGG LVGYLGYDVV RRWEKIDTAA SATRPPAPAD PEIPELVMLL
ATDLAALDHH AGTVTLIANA VNWDGTDARV DQTYDHAVAR LHEMSRTLAQ PRSLPAAHFT
ARTPPVRRRT ESAEYQANVD VAKEHIRAGD AFQIVLSQRF DVPTEADPLD IYRVLRATNP
SPYMYLLRLP TPDGGSFSVV GSSPEALVTV REGLVTMHPI AGTRPRGHTE EDDVWLAKDL
LADEKERSEH VMLVDLGRND LGRVCAPGTV KVVDFFTIER YSHVMHIVST VTGQLAADRT
AYDALAACFP AGTLSGAPKP RAMQIINELE PLRRGVYGGV VGYLDFAGDA DTAITIRTAL
VVDGTAYVQA GAGVVADSVP ENEDAECRNK AAAVIAAVGA AATMQVVGAT QVIGD