Gene Cmaq_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1045 
SymboltrpD 
ID5710181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1095437 
End bp1096456 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content48% 
IMG OID641275545 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001540864 
Protein GI159041612 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000000045527 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGCCAC TCCTAGAGAA GATAGCTAGG GGTCTTGAGT TGAGTCTTGA GGAGGCCTAT 
AACGCCGCCT TAGCCATATT AAAGATGGAG GCAGGTGAGG CTGAGACTGC TGCGTTACTC
ATGGGGCTTA GGGTGAGGGG TGAGCGTGCC TTTGAGGTTG CAGGCTTCGC TAAGGCGCTT
AGGGAAACAT GCCTAAGAAT ACCGGTTAAT GACCCATACG TAATAGATAC AGCTGGCACT
GGGGGTGATG GGTTAAGGAC AATGAACGTA TCCACTATTT CAGCATTACT GGCAGCCTAC
CTGGGGGTTA AGGTGCTTAA GCACGGTAAT AGGAGTGTCT CATCATCATC AGGCAGCGCA
GACTTCCTGG AGGCCCTCGG CTTCAACATA AGTGTGAAAC CTGAAACTGC ATTACTAATG
CTGAATAACC ATAGGTTCTC ATTCGCCTTC GCGCCAATGT ACCACCCAGC CATGAAGAAT
GTTATGCCTG TTAGGAGGAG GCTCGGTATA AGAACTATAT TTAACCTAGT GGGTCCATTG
GCTAATCCAG GCCTTGTGAG GAGGCAGGTG CTTGGTGTGG CTGAGGCTGG AATAATGGGG
GTTATGGCTG AGGCAGCTGG TTTAATAGGT TATGACCACC TCCTGTTAGT TCACGGTGAA
CCAGGTATTG ATGAGGTTTC AGTATTCGGG AGAACAATGA TATATGAGGT TAAGGGTAAT
TCCATTGATA AATACGTAAT TGAGCCCCCT GAACTTGGTT TACGCATACA TGAGTTAAGG
GATGTAGTGG TCTCTAACCC AATGGAGAGC ATTGAGAAGG CTAAGAGGGG TTTAATGGGT
GTTGATGAAG CTGCCTTAGA CTTCATAGCC GCCAACACAG CAATGGCGCT CTACGTCGCT
GGTAAGGTTA AGGATCCTAG GGATGGTGTT GAGGCCGTTA AGCAGATTGC AGGTAATTCA
AACGACTTCT GGAGCTACGT TAATAATGTC GCAGCAGTGA GTAGGCGTGA CTCTGCTTAA
 
Protein sequence
MRPLLEKIAR GLELSLEEAY NAALAILKME AGEAETAALL MGLRVRGERA FEVAGFAKAL 
RETCLRIPVN DPYVIDTAGT GGDGLRTMNV STISALLAAY LGVKVLKHGN RSVSSSSGSA
DFLEALGFNI SVKPETALLM LNNHRFSFAF APMYHPAMKN VMPVRRRLGI RTIFNLVGPL
ANPGLVRRQV LGVAEAGIMG VMAEAAGLIG YDHLLLVHGE PGIDEVSVFG RTMIYEVKGN
SIDKYVIEPP ELGLRIHELR DVVVSNPMES IEKAKRGLMG VDEAALDFIA ANTAMALYVA
GKVKDPRDGV EAVKQIAGNS NDFWSYVNNV AAVSRRDSA