Gene Cag_0798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0798 
Symbol 
ID3747452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1119712 
End bp1120875 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content48% 
IMG OID637773328 
Productaspartate aminotransferase 
Protein accessionYP_379107 
Protein GI78188769 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTGC ACCTTAGTAA TCGCCACGCC TCAGTGCTGC AATCCGAAAT TCGTAGCATG 
TCCATAGCAT GTAGTCGTGT TAACGGCATT AATCTCGCTC AAGGTGTTTG CGATACTCCC
GTGCCCAACG AGGTGCTGCA AGGTGCAAGC GAAGCTCTTC AGCAAGGGGT GAACACTTAT
ACCCATTATG CAGGAATTAT TAGTTTACGC GAAGCTATTG CCGATAAGCA AGAACGTTTT
TATGGTATTC GCTACCAGCC TGAATCAGAA ATTATTGTAA GTGCCGGTGC AACTGGTGCG
CTATACGCAG CTTTTCAAGC ATTACTGAAT CCGGGCGATG AGGTAATTTT ATTTGAACCC
TTTTATGGTT ACCACATAAC CACCTTGCAA GCGGCGGAAG CGGTGCCTCT CTATCTACCG
TTAACGCTGC CGGAATGGAG CTTTAGTGAG CACGATCTTG AACAACTTGT TACACCACGT
ACACGAGCCA TTATTGTTAA TACGCCTGCA AATCCCTCAG GAAAAGTCTT TTCATTAGCA
GAAATGGAGC GTATTGCAGC CTTTGCCGAG CGTTACGACC TATTTGTTTT TACGGATGAA
ATTTATGAAC ACTTTCTCTA CGAAGGGCAT CAACACCATA GTTTTGCCGC ATTGCCCGGC
ATGAAAGAGC GCACCATAAC GGTGTCGGGG GCTTCAAAAA CCTTTAGCGT TACGGGATGG
CGTATTGGAT ATGCCTTGTG CGACGCGCGT TGGGCGCAAG CTATCGGTTA CTTTAATGAC
CTTGTCTATG TTTGTGCGCC AGCACCATTG CAAGCAGGGG TTGCGCGTGG TATGAGAGAA
CTTGATGATC GTTTTTACAA CCATCTGTCG GTTGATTATC AAGCAAAGCG CGATCGCTTT
TGTGCAACTT TAGCAAAAGC AGGGCTTGTT CCACACATTC CCGATGGTGC CTATTATGTG
TTAGCCGACG TTTCAGCATT ACCCGGCAAT AGTGCTCACG AGCGAGCCAT GCACATTCTT
AATCGCACAG GCGTGGCAAG CGTCCCGGGC AGCGCATTTT ATCAACATGG TAGAGGCGAT
GGGTTAGTTC GTTTTTGCTA CGCCAAAGAG GATGCAATTT TAGAAGAGGC GTACCAACGT
CTTGAGCGGT TGAGAGAGGG GTAA
 
Protein sequence
MSLHLSNRHA SVLQSEIRSM SIACSRVNGI NLAQGVCDTP VPNEVLQGAS EALQQGVNTY 
THYAGIISLR EAIADKQERF YGIRYQPESE IIVSAGATGA LYAAFQALLN PGDEVILFEP
FYGYHITTLQ AAEAVPLYLP LTLPEWSFSE HDLEQLVTPR TRAIIVNTPA NPSGKVFSLA
EMERIAAFAE RYDLFVFTDE IYEHFLYEGH QHHSFAALPG MKERTITVSG ASKTFSVTGW
RIGYALCDAR WAQAIGYFND LVYVCAPAPL QAGVARGMRE LDDRFYNHLS VDYQAKRDRF
CATLAKAGLV PHIPDGAYYV LADVSALPGN SAHERAMHIL NRTGVASVPG SAFYQHGRGD
GLVRFCYAKE DAILEEAYQR LERLREG