Gene Cagg_2587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2587 
Symbol 
ID7267176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3153045 
End bp3154154 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content57% 
IMG OID643567411 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002463892 
Protein GI219849459 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000154742 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACACCTG TGTTATCACA ACTTATTCGT CCCGAAATTG CCGATCTCGA ACCGTATACT 
CCAATCGTGC CATTGGAAGT ATTGGCCGCT CGGCTCGGCC TACCGGTCGA ACAGATTGTC
AAGCTCGATG CTAACGAAAA TCCGTATGGG CCTGCCCCGG CGGCGCTGGA GGCCATCCAC
CGGACGGCAA CGTACCACAT TTATCCCGAT CCTGAACAAA CGGCGTTGCG CAGCGCCCTC
GCGAGCTATA CCGGTCGCCC AATTGAAGAG ATTCTCTGCG GGGCCGGTGC CGATGAGTTG
ATCGATTTGG TTTTGCGCCT AATCATCAAT CCCGGTGATG CGATTATCGA TTGTCCACCG
ACGTTTGGCA TGTACCGATT TGATGCCGGT ATCTGTGGCG GGCGGGTGAT CAGCGTACCT
CGTCGAGCTG ATTTTAGCCT TGATTTGCCG GCTATCGAAC GCGCGGCAGC GCAGGGAGCC
AAAGCGATCT TTCTCACGGC CCCCAACAAT CCCACCGGCA ATCCGCTCCC CCGCACCGAT
CTCCTGCGTA TCCTCGAACT ACCGATCTTG GTGGTGGTCG ATGAGGCGTA TGTTGAGTTT
GCCGAACCCG ACCATGCGCC GATTGGGGCC AGCGATCTCC TCGATGATTA CCCAAATCTG
GTCATCTTAC GGACGTTCAG CAAATGGGCC GGTCTCGCCG GATTGCGGGT CGGTTACGGT
CTCTTTCCGC GCTGGCTGAG TGAACAGCTC TGGAAGATTA AGCAACCCTA CAATGTATCG
GTAGCCGCTC AAGCGGCGGC GGTCGCATCA CTCAACGCCA TGACCGAGCT GCGCCGGCGC
GTACAAGCGA TCGTCGCCGA ACGCGAACGG TTGTTTGCCC GTTTGAGTGA GGTGGCCTGC
CTGACGCCCT TTCCCAGTGT AGCCAACTTT ATCCTTTGTC GGGTGAACGA CCGTGATGCG
CATGAACTCA AGCTCGCGCT TGAACGGCGG GGGGTGTTGG TACGTCACTA CCGCACACCA
CTGCTCGATG GGTATATTCG GATCAGCGTC GGGACGCCGG ATCAAACCGA TACCTTGCTC
GCAACGATTG CCGAGGTGGT CAATGAGTGA
 
Protein sequence
MTPVLSQLIR PEIADLEPYT PIVPLEVLAA RLGLPVEQIV KLDANENPYG PAPAALEAIH 
RTATYHIYPD PEQTALRSAL ASYTGRPIEE ILCGAGADEL IDLVLRLIIN PGDAIIDCPP
TFGMYRFDAG ICGGRVISVP RRADFSLDLP AIERAAAQGA KAIFLTAPNN PTGNPLPRTD
LLRILELPIL VVVDEAYVEF AEPDHAPIGA SDLLDDYPNL VILRTFSKWA GLAGLRVGYG
LFPRWLSEQL WKIKQPYNVS VAAQAAAVAS LNAMTELRRR VQAIVAERER LFARLSEVAC
LTPFPSVANF ILCRVNDRDA HELKLALERR GVLVRHYRTP LLDGYIRISV GTPDQTDTLL
ATIAEVVNE