Gene Caci_8801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8801 
Symbol 
ID8340194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp10204729 
End bp10205970 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content69% 
IMG OID644961891 
Productaminotransferase class V 
Protein accessionYP_003119455 
Protein GI256397891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.778555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGAC ATGATAGCGC TACATCCATC GCCCCCGAAG CGCTACTGGA TGTCGCAGAG 
CTGCGCGCCG AGACGCCCGG CTGCGCGGAG GTCATCCACT TCAACAACGC CGGCTGCGGA
CTGATCGCCG CTCCGGTGCT GCGACCCGTG CTGGAGCATC TCGCTCTCGA AGCGCGGATC
GGCGGCTACG AGGCGGCAGC CCGGCAAGCC GACGCGGTCG CCGACTTCTA CGCCGCGACC
GCTTCGATCA TCGGCGCCGC ACCGCGGAAC ATCGCCTTCG CCAGCAGTGC GACCCACGCC
TTCGCCACCG GCGTGTCCGC GATCCCCTTC GAGCCCGGCG ACGTCATCGT GACCACCCGC
AACGACTTCA TCTCCCAGCA GATCGCGTTC CTGTCACTGC GCAAGCGGTT CGGCGTGCAG
ATCGTCCACG CGCCCGACGC CCCCGAGGGC GGCGTGGACG TGGCAGCGAT GGCCGATCTC
CTGCGCCGGC ACCGTCCCCG GCTCGTCGCG GCGACGCACA TCCCGACCAA CTCCGGACTG
GTCCAGCCGG TCGCCGAGAT CGGCCGGCAC TGCCGTGAAC TGGAGCTGCT GTACCTGGTG
GACGCCTGCC AGTCCGTGGG GCAGGTCCCG GTCGACGTCG AGGCGATCGG CTGCGACCTG
CTCACCGCGA CCTGCCGCAA ATATCTTCGC GGGCCTCGGG GATCGGGCTT CCTGTATCTG
TCCGACCGGG TTCTGTCCGC CGGCTACGAG CCGCTGTTCA TCGACATGTA CGGCTCGCGG
TGGGTCGCGC CAGATGCCTA TCAGCCTGCT GAGACAGCGG CACGCTTCGA GGACTGGGAG
TTCCCCTACG CGACTGTGCT GGGCTGCGCA GCCGCGGCGC GCTATGCCGA GCGCGTAGGC
GTGGAAGCCA GCGGCGCCCG AGCGCTGGCG CTGGCAGCGA ACCTGCGGAC CCGGCTGAGA
GCGATCCCCG GCATCCGCGT CCTGGAGCAG GGTGCGGTAC TCGGCGCCCT CGTCACGTTC
ACGATCCAGG GCTGGCAGCC ACAGCCCTTC AAAGCGGCGA TGGACGCGCG CGGCATCAAC
TCGGCGCTCA GTTTCCGGGA GTTCGCCGTG TTCGACTTCG GCGACAAGGA CGTCGACTGG
TGTCTGCGCC TGTCGCCGCA CTACTACAAC ACCGAGGAGG AAGTCGCTGT CGTCGCGGCG
GCGGTGGCCG AACTCGCCAC CCCTGCGGGG AGCGTCCGGT GA
 
Protein sequence
MNGHDSATSI APEALLDVAE LRAETPGCAE VIHFNNAGCG LIAAPVLRPV LEHLALEARI 
GGYEAAARQA DAVADFYAAT ASIIGAAPRN IAFASSATHA FATGVSAIPF EPGDVIVTTR
NDFISQQIAF LSLRKRFGVQ IVHAPDAPEG GVDVAAMADL LRRHRPRLVA ATHIPTNSGL
VQPVAEIGRH CRELELLYLV DACQSVGQVP VDVEAIGCDL LTATCRKYLR GPRGSGFLYL
SDRVLSAGYE PLFIDMYGSR WVAPDAYQPA ETAARFEDWE FPYATVLGCA AAARYAERVG
VEASGARALA LAANLRTRLR AIPGIRVLEQ GAVLGALVTF TIQGWQPQPF KAAMDARGIN
SALSFREFAV FDFGDKDVDW CLRLSPHYYN TEEEVAVVAA AVAELATPAG SVR