Gene Noca_4527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4527 
Symbol 
ID4597046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4788425 
End bp4789738 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content74% 
IMG OID639779138 
Productanthranilate synthase component I/chorismate-binding protein 
Protein accessionYP_925711 
Protein GI119718746 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACC CCGCCCTCGA CCCCGCCCTC GATCCCGCAC GCGACCCGGT GGCGTTCTTC 
CGCGGCGTGG CGGCGGCGTA CCCGCGCTGC TTCTGGCTCG ACGGTGGCGG TGCCCGGGAG
TGGTCGGGTC GTCGGTCGAT GGTCGGCTGG CTGGGCGAGG ACGACGTGTC GCTGACCTAC
TCCGCGGCCC GCCGGGAGGT GCGCCGGCAC GTCGGCGGCA GCTGCGAGGT GGTCGGCGAC
GACGTGTTCG TCGTACTCGA GGCCGAGCTG GCCGCCGGCG CGCCCGATGA CCACTGGGTG
GGCTACCTCG GCTACGCCTG CCGCCCGGAC CTGCCCGCCT CGACCGGTCC CGGCCTGCCC
GACGCCGTGT GGATGCGGCC GGCGGGCGTC CGGTTCTTCG ACCACGGACT GGGCGGGCAA
TCCCGGAACT TCCTGGGGGA AGTTCCGGCC CAACAGTTCC GGTTTCCCCG GTTCACCGGG
GAAACCGACC CGGCCCCGCC CGCCTACGCC ACCGCGTTCG AGGAGGTGCA GGAGCAGCTG
CGGGCGGGGA ACAGCTACGA GGTCAACCTG ACCTACCGGC TGGCGCATCG CAGCGGGGTG
GACCCGGTGA CGGCGTACCT GAGGCTGCGC GAGCTCAACC CGGCGCCGTA CGCCGGGTTC
CTCCAGCACG ATGTCCGCGA CGTGCCGGAC GCCCGGGCCT GGCTGCTCAG CTCCAGCCCG
GAGCGCTACG CGCTGGTGAC CGCCGACCGG AGCATCGAGA CCAAGCCGAT CAAGGGCACC
ACGCTGCGCG GCGCGACCCC CGCCGAGGAC GAGGCCAGCC GGCACCGGCT CGCGACCGAC
TCGAAGTTCC GCGCCGAGAA CCTGATGATC GTCGACCTGC TCCGCAACGA CCTCTCGATG
GTGTGCCGCC CGGGGACCGT GAGCGTGCCG GCGCTGATGG ACGTCGAGTC CTACGCGACC
GTGCACCAGC TGGTCAGCAC CGTCCGCGGC GAGCTGCGCG ACGACGTCAG CACGGTGCAG
GCGCTGCGCG CGCTGTTCCC GGCCGGCTCG ATGACCGGCG CGCCGAAGCT GCGCACCATG
CAGGTGATCG AGCAGGTCGA GGCCACCGAG CGCGGCCCGT ACGCCGGCGC CTTCGGCTGG
GTCTGCGCCG ACGGCCGCGC CGACCTCGGC GTGGTCATCC GCAGCCTCGC CAGCACCGGC
GACGGCGCCT ACCTGCTCGG CACCGGCGGC GGGATCACGG TCCGCTCCGA GGTCGCCGAG
GAGTACGCCG AGTCCCGCTG GAAGGCCGAC CGGCTGCTGG CTGCGCTCGG CTGA
 
Protein sequence
MSDPALDPAL DPARDPVAFF RGVAAAYPRC FWLDGGGARE WSGRRSMVGW LGEDDVSLTY 
SAARREVRRH VGGSCEVVGD DVFVVLEAEL AAGAPDDHWV GYLGYACRPD LPASTGPGLP
DAVWMRPAGV RFFDHGLGGQ SRNFLGEVPA QQFRFPRFTG ETDPAPPAYA TAFEEVQEQL
RAGNSYEVNL TYRLAHRSGV DPVTAYLRLR ELNPAPYAGF LQHDVRDVPD ARAWLLSSSP
ERYALVTADR SIETKPIKGT TLRGATPAED EASRHRLATD SKFRAENLMI VDLLRNDLSM
VCRPGTVSVP ALMDVESYAT VHQLVSTVRG ELRDDVSTVQ ALRALFPAGS MTGAPKLRTM
QVIEQVEATE RGPYAGAFGW VCADGRADLG VVIRSLASTG DGAYLLGTGG GITVRSEVAE
EYAESRWKAD RLLAALG