Gene Namu_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4017 
Symbol 
ID8449636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4433119 
End bp4434234 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content75% 
IMG OID645043062 
ProductChorismate binding-like protein 
Protein accessionYP_003203298 
Protein GI258654142 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.3825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.222242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTA CCCCGCCCTT CGCGCACGTC GGCGGCCTGC TGGCGACCGA GCTGGAGCAG 
GTGGCGGACC TGGCCGCCGA TCCGTCGGTG CTGGACCGCG GTTGGTGGGT GGTGGTCGGC
ACGTTCGAGG GCGAGCTGAC CGGGTACCGG TTCGGCCGGG TCCGCCCGGC CGAACTGCCG
GCGCCGACCG GTCGCTGGCC GGGTCCGGCG GCCGACCAGT GGCACTCCAG CCTGGACCGG
GCCGCCTACC AGGACGGGGT CCGCCGCATC CGCGCCTTCA TCGCCGCCGG CGATGTCTAC
CAGGTCAATC TCTGTCGGCT GGTGCGGGCG CCGTTGGCCG CCGACGCGGA CCCGCTGGTG
CTGGCCCACC GGCTGGCCGC CGGCAACCCG GCCCCCTGGT CCGGGCTGCT GCACCTGGGC
TCGTCCTGGA TCGTCAGCGC CTCCCCCGAG CTGTTCCTGG CCCGGGACGG CGACCGGCTC
GCCTCGTCGC CGATCAAGGG CACCACCCGT CCGGGGGAAC CGTTCGCGGA CAAGGACTTC
CCCGAGAACA TCATGATCAC CGACCTGGTC CGCAACGACC TGGGCCGGAT CGCCCGGCCG
GGTTCGGTGG TGGTCACCGA ACTGCTGGCC CGGCAGGAGC ATCCCGGATT GGCCCACCTG
GTCTCGACGG TGACCGCCCG GCTGGCCGAC GGCGTCGGCT GGGGGCAGAT CCTGCCGGCC
ACCTTCCCGC CCGGGTCGGT GACGGGCGCG CCGAAGATCC GCGCGCTGGA GGTGATCGCC
GAACTGGAGC CCGTGCCCCG GCAGGTCTAC TGCGGCGCCT TCGGCTTCGT CGACGGCGAG
CACCGGCGGG CCCGGCTGGC GGTGGCCATC CGGACGTTCT TCGCCACCAC CGACGCGGCC
CCCGGTCACC GGCCGGTCGC GGGGTCCGGG ACGCTGCACT TCGGCACCGG CGCCGGCATC
ACCTGGGCCT CGGACCCGGC GGCCGAATGG GCCGAGACCG AGCTCAAGGC GGCCCGGTTG
ATCGGGCTCA CCGGCGCGGC GCCGACCACC CCGGATCCCG GCCGGTCAGC CCCAGCACCT
GATCCAGCGC CGACGCGGTC GGCGGCACCG CGATGA
 
Protein sequence
MTPTPPFAHV GGLLATELEQ VADLAADPSV LDRGWWVVVG TFEGELTGYR FGRVRPAELP 
APTGRWPGPA ADQWHSSLDR AAYQDGVRRI RAFIAAGDVY QVNLCRLVRA PLAADADPLV
LAHRLAAGNP APWSGLLHLG SSWIVSASPE LFLARDGDRL ASSPIKGTTR PGEPFADKDF
PENIMITDLV RNDLGRIARP GSVVVTELLA RQEHPGLAHL VSTVTARLAD GVGWGQILPA
TFPPGSVTGA PKIRALEVIA ELEPVPRQVY CGAFGFVDGE HRRARLAVAI RTFFATTDAA
PGHRPVAGSG TLHFGTGAGI TWASDPAAEW AETELKAARL IGLTGAAPTT PDPGRSAPAP
DPAPTRSAAP R