Gene Caci_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1946 
Symbol 
ID8333289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2201182 
End bp2202951 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content67% 
IMG OID644955095 
Productaspartate aminotransferase 
Protein accessionYP_003112707 
Protein GI256391143 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.346248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0292973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCA CCACTCCGGA GAAGAAGAGC GCCGCGCGCG CCAAGGCTGG CAGCGCCAGG 
AGCGCCAAGA GCGCTAGGGG CGCCACCGCC AAGAACAACA CCGCCGCTCC GCGCAAGCGT
GTCCCCTCCA CGGTCACGTC GGCGGGCCTG TCCCGCGGCA AGATCAAGGA GTGGTCCGCC
CTGAGTCCGT TCGAGCTCAA GGGTGAGCTG ATCGCGCTGG CGGCCGACAC GCACAAGAAG
TCGGCCGCGC AGATGCTCAA CGCCGGGCGC GGGAACCCGA ACTGGATCGC GACCGGGCCG
CGCGAGGCGT ATCTGGCGCT GGGTGCCTTC GCGCTGCAGG AGTCGCGGCG GGTCTGGACC
ATGGACAACC TCGGCGGGAT GCCGGAGGTG GCCGGGAGCG GCGTGCGCTT CGACCGCTTC
TGCCTGGCGA ATCCGGGGAT GACCGGGGTG CGGCTGCTGC GCGACATGGT GGACTACGGC
GTCACCCAGC TGAAGTTCGA CGCCGACGCG TGGATCGGCG AGCTCACCGA CGCCTGGATC
GGCGACCACT ACCCGGACCC GCCGCGGGCG CTCAAGCACT GCCAGAAGGT CGTGCGCGCC
TATCTGGCCG AGGAGATGTA CGGCGGAAAG ACGCCCGGCA AGGTCGACGT CTTCCCGACC
GAGGGCGGCA CGGCGGCGAT GTGCTACCTG TTCGACACGC TGGTCACCAA CGGCATCCTG
CACCCCGGCG ACACCATCGC CCTGATGACG CCGATCTTCA CGCCCTACAT CGAGATCCCC
GAGCTGGAGC GTTACTCCTT CAACGTAATC CAGGTGAAGT CGGACATGAT GACCGAGGAG
GGCGTCCACA TGTGGCGCTA CCCCGACTCC GAGGTCGACC GGCTGGCGGA CCCGAAGGTC
AAGGCCGTGA TGCTGGTGAA CCCCTCGAAC CCGCCGTCGA TGGCGATGTC CGACCGGGTC
CGCGACCGCA TCGCCGACAT CATCCGGACC AAGAACCCGA ACCTGGCGAT CATCACCGAC
GACGTCTACG GCACGTTCGT CCCCGGCTTC CGCTCCCTGG CCGCGACCTG CCCGCGCAAC
ACGGCGCTGG TCTACTCCTG GTCCAAGCAC TACGGCGCGA CCGGCCACCG CCTCGGCGTG
ATCGCGGTGG CCGAGGACAA CGTCTTCGAC CAGATGCTCG CCAAGCTCCC GAAGGCGAAG
AAGGACGAGC TGCGCCGGCG TTACAGCACC CTGACTCTGC ACCCTGAGAA GACGAAGTTC
ATCGACCGCC TGGTCGCCGA CAGCCGCGCG GTCGCGTTGA ACCACACCGC GGGCCTGTCC
ACCCCGCAGC AGACGATGAT GATGCTGTTC TCCCTGTTCG ACCTGCTGCC GGAGGGCCAG
GAGTACAAGG AGCTGCTGCG CACGATCGTC CACCGCCGCC TGGACCTGCT GATGGAGGGC
ATCGGCGTCC ACCACATCAG CGACGACCCG GACCGCGCCT GCTACTACGT CGAGCTGGAC
ATCCTCGCCG AGGCCGAGGC CTTCGAGAGC CGGGAGTTCG CCGACTTCCT GATGGAGACC
TACGAACCCA CCGACGTGGT GTTCCGCCTG GCGCATCAGG CCTCTGTCGT GCTGCTGAAC
GGCGGCGGCT TCGACGGACC GGGATGGTCG GTGCGGGTGT CGCTGGCGAA CCTGGACGAC
TTGGACTACC TGAAGATCGG GCATCACCTG CACGCGATCA TGGAGGAGTA CAAGGAAGAG
TGGCTGAAGA CCAAGACCAA GAAGAAGTGA
 
Protein sequence
MTATTPEKKS AARAKAGSAR SAKSARGATA KNNTAAPRKR VPSTVTSAGL SRGKIKEWSA 
LSPFELKGEL IALAADTHKK SAAQMLNAGR GNPNWIATGP REAYLALGAF ALQESRRVWT
MDNLGGMPEV AGSGVRFDRF CLANPGMTGV RLLRDMVDYG VTQLKFDADA WIGELTDAWI
GDHYPDPPRA LKHCQKVVRA YLAEEMYGGK TPGKVDVFPT EGGTAAMCYL FDTLVTNGIL
HPGDTIALMT PIFTPYIEIP ELERYSFNVI QVKSDMMTEE GVHMWRYPDS EVDRLADPKV
KAVMLVNPSN PPSMAMSDRV RDRIADIIRT KNPNLAIITD DVYGTFVPGF RSLAATCPRN
TALVYSWSKH YGATGHRLGV IAVAEDNVFD QMLAKLPKAK KDELRRRYST LTLHPEKTKF
IDRLVADSRA VALNHTAGLS TPQQTMMMLF SLFDLLPEGQ EYKELLRTIV HRRLDLLMEG
IGVHHISDDP DRACYYVELD ILAEAEAFES REFADFLMET YEPTDVVFRL AHQASVVLLN
GGGFDGPGWS VRVSLANLDD LDYLKIGHHL HAIMEEYKEE WLKTKTKKK