Gene Caci_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4144 
Symbol 
ID8335498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4683950 
End bp4685179 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content67% 
IMG OID644957247 
Productaminodeoxychorismate lyase 
Protein accessionYP_003114849 
Protein GI256393285 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0208419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGACG AGGAGCCCGG AGGGCTCGGC GGGTTCGCGG ACGAGGTCTC CCGCATGCTG 
CATGCCAGGG CGCTACTACA CCAGGAACAC ACCGACCCGG TCGGGATCAC CAGGAGGAAC
CTGAAGATGG CTAAGCAACG TCGGCGTGCG GTGCTCGGAA CGACCACGAC GCTGGCGGTG
ACCGGCGGTG CGGTGTTCGG CGTCTACGCA CTGTCCGGGC ATGCGCAGGA CAACGGCGGG
ACTCGCAGCG TGGGGCAGAA CGTCGCTGCC GACTACACCG GGGCGGCGAC CTGCCCGGCG
ACCGCGAAGG TCCCGGTGGA CGTCCCGCGC GGCGCGACCA CCACGCAGAT CGCGAACGCG
CTGTTCACCG CCGGTGTGGT GGCCAGTCCG CAGGCGTACG TCGATGCTGC TGACCGGAAT
CAGGGCTCTG TCGGCATCAC CGCCGGAACC TATGCGATCT GCCCGCAGAT CTCCGGCGCC
AACGCGGTGC TGGAGCTGTC GAAGAAGTCG AACCTGTCGG ACGCCTCGCA GATCATCGTG
ACCTCCCACG AGTGGTCGAA GGACGTCATC GCGAGCTTGG TCGACAAGCG GAAGTGGAAG
CAGGCCGACT TCGACGCCGC GATCGCGAGC AACACGATCG GGCTGCCGGC GTGGTCGGTG
GACTCCACGA GCCACAAGTT CACCGCCGAG GGCATGCTGG AGCCGGGGAC GTACTCGATC
ACGTCGTCCG ACACGCCGCA GAGCATCCTG TCGCAGATGG TCGCCAAGCG GATGACGTAT
TTCAAGAGCA TCGACTTCGA GAACAAAGCT GCGAGTCTGG TCTGTGGCGC CGCGAAGTGC
ACGCCGGAGC AGGTGCTGAC GATCGCCTCG ATCGCCGAGG GCGAGGTCGC CGAACCCGGT
GACGGCGCCC GCGTCGCCGA GGGTGTCTAC GCGCGCTTGA AGGCCGGGGA CTATCTCGCC
GTGGACTCCA CGGCGCTGTA CGCCATCGGG CACCTCCCGG CCGGCCAGCT TCCGTCTGCC
AAGCAGGTCC AGGATCCGAA CAACCCGTAC TCGACCTACG CGCCGCACCA CGGTCTGCCG
CCGACGCCGG TCTACATCAC GTCCGACGAC ATGATCAAGT CCGCGCTCGC GCCGACCCAC
GACGGCACCT ATTACTGGTG CGTCACCTCA ACCGGTGCCC GCTTCTTCAC CAAGGGCCAG
GAGACGCAGC GCGATCAGGG CTGCTCGTAA
 
Protein sequence
MRDEEPGGLG GFADEVSRML HARALLHQEH TDPVGITRRN LKMAKQRRRA VLGTTTTLAV 
TGGAVFGVYA LSGHAQDNGG TRSVGQNVAA DYTGAATCPA TAKVPVDVPR GATTTQIANA
LFTAGVVASP QAYVDAADRN QGSVGITAGT YAICPQISGA NAVLELSKKS NLSDASQIIV
TSHEWSKDVI ASLVDKRKWK QADFDAAIAS NTIGLPAWSV DSTSHKFTAE GMLEPGTYSI
TSSDTPQSIL SQMVAKRMTY FKSIDFENKA ASLVCGAAKC TPEQVLTIAS IAEGEVAEPG
DGARVAEGVY ARLKAGDYLA VDSTALYAIG HLPAGQLPSA KQVQDPNNPY STYAPHHGLP
PTPVYITSDD MIKSALAPTH DGTYYWCVTS TGARFFTKGQ ETQRDQGCS