Gene Cag_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1086 
Symbol 
ID3747953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1467990 
End bp1469081 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content46% 
IMG OID637773617 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_379391 
Protein GI78189053 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.742726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGC CTTATTCTAT AGAGCATTTA CTCAATCCCG CATTGCGCAA CATTGCTACC 
TACAAGGTGG AGGGTGGTCA GCAAGCTGAA ATTAAGCTGA ACCAAAATGA AAGTCCCTTT
GATGTGCCGC AATGGCTTAA GGAGGAAATT ATTGGTGAGT TTATTCGGGA GCCATGGAAT
CGCTACCCCG ATATTCTCCC TTATCGCGCC ATGGAGGCGT ATGCTAATTT TGTGGGGGTA
CCTGCTGAGT GTGTGATAAT GAGCAATGGC TCCAATGAAA TGCTCTACAC TATTTTTCTT
GCCTGCTTAG GACCAGGTAG AAAAGTGCTT ATTCCCAATC CCTCTTTTTC GCTCTATGAA
AAACTTGCCC TGCTCTTGCA GTCGGATATT GTAGAGGTAC CTATGAAGAG TGATCTTTCG
TTTGATGTTG AGGCTATTAT GAAGGCGGCG CACAATGAGG CGGTGGATGT GATTGTGCTC
TCGAATCCCA ACAATCCCAC CTCCACCTCA ATGAGTTACG ATGCAGTCCG TAAGATTGCG
GAGTCCACGC AAGCGTTGGT GTTAGTTGAT GAGGCGTATA TTGAGTTTTC GCGTGAGCGT
TCAATGGTGG ATACTATTGA AGAATTACCT AATGTAGTGG TGTTGCGTAC CATGTCGAAA
GCGCTTGCGC TTGCGGGTAT TCGTATTGGT TTTGCTCTTG CAAATGCGCC GTTGATGGCT
GAAATTTCTA AACCAAAAAT TCCTTTTGCC TCAAGCCGTC TTGCTGAAAT TACCTTAATG
AAGGTGCTTG CAAATTATCG TTTAGTAGAT GAAGCGGTTT CGGCTATTTT AAGCGAGCGC
GATGCCTTGT ATGAGCAGTT GCGCATGATG GAGGGCGTTT CGCCGTTTGC CAGCGACACG
AACTTTTTAA TTGTGCGAGT AGCCGATGCT AACGCTACCT TTAAGCGCCT TTACGATAAG
GGAATTTTGG TACGCAATGT GTCGGGCTAT CACTTAATGG AGGGGTGTTT GCGCTGCAAT
GTTGGTTTGC CTGAAGAGAA TCGCCGTTTA GCCGAGGCGT TTGCTGAGCT TTCAGTGGAA
GTGAAAGGAT GA
 
Protein sequence
MNTPYSIEHL LNPALRNIAT YKVEGGQQAE IKLNQNESPF DVPQWLKEEI IGEFIREPWN 
RYPDILPYRA MEAYANFVGV PAECVIMSNG SNEMLYTIFL ACLGPGRKVL IPNPSFSLYE
KLALLLQSDI VEVPMKSDLS FDVEAIMKAA HNEAVDVIVL SNPNNPTSTS MSYDAVRKIA
ESTQALVLVD EAYIEFSRER SMVDTIEELP NVVVLRTMSK ALALAGIRIG FALANAPLMA
EISKPKIPFA SSRLAEITLM KVLANYRLVD EAVSAILSER DALYEQLRMM EGVSPFASDT
NFLIVRVADA NATFKRLYDK GILVRNVSGY HLMEGCLRCN VGLPEENRRL AEAFAELSVE
VKG