Gene Caul_2777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2777 
SymboltrpD 
ID5900232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3014879 
End bp3015922 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content72% 
IMG OID641563269 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001684402 
Protein GI167646739 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000134722 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000003256 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGACG CCTTCAAGCC CCTGCTGGCC AAGCTGGCCG ACGGCCAGAC CCTCGACGAG 
GACGACGCCG AGCAGTTCTT CGCCGCCTGC CTGCGCGGCG AGCCGACCCC GGCCCAGGTG
GCCGCGGCGG TCACGGCCAT GCGCCTGCGC GGCGAGACGG TGGGCGAAAT CACCGCCTGC
GCCCGCGCCA TGCGCCGCGC CGCCATCCAC CTGGACCATC CCTATGAGGT GATCGACGTC
TGCGGCACCG GCGGCGACGG CCTGCACACC CTGAACATCT CCACCGCCGT GGGCTTCGTG
GCCGCTGGCG GCGGCCTGAA GGTGGCCAAG CACGGCAACC GGGCGATCAC CAGCAAGTCG
GGGACCGCCG ACGTCCTGGC GGCCCTGGGG GTCAATATCG ACGCCAGCCT GGCCCAGCAG
CGCCACGCGC TGGATACGGC CGGCATCTGC TTCCTGTTCG CCCAGGCCCA CCACGGCGCG
ATGAAGCATG TCTCGCCCAT CCGCCAGCAG CTGGGCTTCC GCACCATCTT CAACCTGCTG
GGTCCGCTGA CCAATCCGGC CGGCGCCAAG CGCCAGGTGG TCGGCGTCTC GGCTCACCGA
TTCGTCGAGC CGGTGGCCAA GGCCCTGGGC GCCCTGGGAG CCGAGCGCGC CTGGTCGGTG
CACGGGGCCG GCATGGACGA ACTGACCACC ACCGGCGAGA CCGAGGTCGC CGAATGGCGC
GACGGCAGCT TGCGCCTGTT CACGATCACT CCCGAAGCCG TCGGCCTGCC GCGCGCCGCC
CTGGCCGACA TCACCGGCGG CGATCCCGCC TATAACGCCG CCGCCCTGAC CGCCCTGCTG
GACGGCCAAA AGGGCGCCTA TCGCGACATC GTCATGCTCA ACGCCGCCGC CGCCTTCCTG
GTGGCCGACA GGGTCGAGAC CCTGCGCGAG GGCGTCGAAC TGGCCGGCGC CGTTCTGGAC
GACGGCCGCG CCAAGGCGGC CCTCGCCGGT CTGGTCGCCG CCACCAACAG TGAAACCGTA
CCCGCCCAAG TGACCCCAGC ATGA
 
Protein sequence
MSDAFKPLLA KLADGQTLDE DDAEQFFAAC LRGEPTPAQV AAAVTAMRLR GETVGEITAC 
ARAMRRAAIH LDHPYEVIDV CGTGGDGLHT LNISTAVGFV AAGGGLKVAK HGNRAITSKS
GTADVLAALG VNIDASLAQQ RHALDTAGIC FLFAQAHHGA MKHVSPIRQQ LGFRTIFNLL
GPLTNPAGAK RQVVGVSAHR FVEPVAKALG ALGAERAWSV HGAGMDELTT TGETEVAEWR
DGSLRLFTIT PEAVGLPRAA LADITGGDPA YNAAALTALL DGQKGAYRDI VMLNAAAAFL
VADRVETLRE GVELAGAVLD DGRAKAALAG LVAATNSETV PAQVTPA