Gene Ndas_3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3137 
Symbol 
ID9246993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3754614 
End bp3755726 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content75% 
IMG OID 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_003681052 
Protein GI297562078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCC CCGCCGCCCA CTCCGCCGCC GGCTCCGGAG CCGCGAACCC CGACGAACGG 
TCCGCCGTGA CCGAGCGGAC CTGGTCCAAC CTCATCACCG CCCTGCTGGG CGGACGGACC
CTGAGCGCCG ACGACACCGC GTGGGCGATG AACGAGATCA TGTCCGAGGC CGCGACCGAC
GCCCAGATCG CCGGGTTCGC CATCGCGCTG CGCGCCAAGG GCGAGAGCGT GTCGGAGGTG
ACCGGCCTGG CCCGGGGCAT GCTCGACAAC GCCGTGCGCA TCCACGTCCC CGGACCGACC
CTGGACATCG TCGGCACCGG CGGCGACCGC GCGCACACCG TGAACGTGTC CACGATGGCG
GCGATCGTGG CCTCGGCCGC GGGCGCCCGC GTGGTCAAGC ACGGCAACCG GGCGGCGTCC
TCCTCCTGCG GCACCGCCGA CGTGCTGGAG CGCCTGGGCG TGGTCCTGGA CCTGCCGCCG
GAGCTGACCG TGCGGGTGGC CGAGGAGGCG GGCATCGCCT TCTGCTTCGC CCCGGTCTTC
CACCCCTCGT TCCGCTTCAC CGCCAAGCCC CGCCGCGAGC TGGCGGTGCC CACGGTGTTC
AACTTCCTGG GCCCGCTGAC CAACCCGGCC CAGCCCACCA CCTCGGCCAT CGGGGTGTTC
GACGAGCGCA TGTGCGAGGT GATCGCCGGG GTGTTCGCCC GCCGCGGCTC CTCGGCCCTG
GTGTTCCGGG GCGACGACGG GCTGGACGAG CTGACCACCA CGACCACCTC CACGGTGTGG
GTGGTGGACG ACGGCCAGGC CCGGCGCGAG ACGCTCGACC CCGCCGACCT GGGCATCGCC
CGCTCGCGTC CGGAGGACCT GCGCGGCGGC GACGTCGGGT TCAACGCCCA GGCCGTGCGC
GACCTCCTGG CGGGCCGCAC CGGCCCGGTG CGCGACGCCG TGCTGCTCAA CGCGGGCGCC
GCGCTGGCCG CCGTGGACGG CATCCGGGGC CCCCTGCTGG AGTCGGTGCG CGACGGGTAC
GAGCGGGCCG CCGCGGCGGT GGACGGCGGC GCCGCCGAGC GCGCGCTGGA GCGCTGGGTG
GAGATCAGCC AGCAGTACGC CAAGGCCCTG TGA
 
Protein sequence
MNVPAAHSAA GSGAANPDER SAVTERTWSN LITALLGGRT LSADDTAWAM NEIMSEAATD 
AQIAGFAIAL RAKGESVSEV TGLARGMLDN AVRIHVPGPT LDIVGTGGDR AHTVNVSTMA
AIVASAAGAR VVKHGNRAAS SSCGTADVLE RLGVVLDLPP ELTVRVAEEA GIAFCFAPVF
HPSFRFTAKP RRELAVPTVF NFLGPLTNPA QPTTSAIGVF DERMCEVIAG VFARRGSSAL
VFRGDDGLDE LTTTTTSTVW VVDDGQARRE TLDPADLGIA RSRPEDLRGG DVGFNAQAVR
DLLAGRTGPV RDAVLLNAGA ALAAVDGIRG PLLESVRDGY ERAAAAVDGG AAERALERWV
EISQQYAKAL