Gene Acid345_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4120 
Symbol 
ID4072311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4879933 
End bp4881402 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content63% 
IMG OID637986151 
Productanthranilate synthase, component I 
Protein accessionYP_593194 
Protein GI94971146 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages
[TIGR00565] anthranilate synthase component I, proteobacterial subset 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.622363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCGC CAGATTTCAA GTCCTTCTCG CAGCTAGCAC GCGAAGCCTC GCTCGTCCCC 
GTCACGCGCA CGATTTCGGC CGACCTCCTC ACTCCGGTTT CCGCCTTCCT TGCTCTGGCC
GACAAAGAGC CTTACGCCTT CCTGCTGGAA TCCGTCGAAG GCGGCGAGCG TATTGGACGC
TACACCTTTC TCGGCATCCG GCCCTACATG GTCGTCACCG GCCGCGGCAG CGAAGTTACG
ATACGCCGCG GTAAGAAGAC CGAAAAGTCG TCTTCCGATC TACTTGGAAC CCTGCGGGCC
GCGTTACGCG AGCACAAGCC CGCCACCGTC CCCGGATTGC CGCCCTTCAC CGCGGGCGCT
GTCGGCTACT TTGCTTACGA TGCCGTGCGC CACTTCGAGC GCCTGCCCGA CATCGCCAAA
GACGACCTCC ACCTTCCCGA CGGCGTCTTC ATGTTCTACG ACCGCCTGCT GGCCTTCGAT
CACCTGCGCC ACCAGTTGCA CCTCATCGCC GCCGCCGACG TCCGCACCGA GAAGCCGCGC
GCCGCCTACG ATCGCGCTAT CGCCGATCTC GATGCGCTGG AGAAAAAGCT CGTATCGGGA
CTGAAGATTC GTCGCCTGCG TCCCGAAAAG AAAACCGCGA AGATCAAGCT GCACGCCCGC
ACAAAGCCCG CCGACTACAT GAACGCCGTG AAGCGCGGCA AGGAATATAT CGCGGCAGGA
GATGTCTTCC AGGTCGTGCT CTCCCAGCGC CTCGACTTCG CACTGCCCGC GCCTCCCTTC
GACATCTACC GCTCTCTGCG CACGGTGAAT CCGTCGCCCT ACATGTACTT TCTGCGCATG
GACGACCTCC ACGTCCTCGG CTCGTCGCCC GAGATGCTGG TGAAAGCCAA CAACCGCACG
CTGGAGTACC GCCCGATCGC CGGGACCTAC AAGCGCGGCG CGACCGCCGA AGAAGATGCG
CGTCTCGAAG AGCACCTTCG CACCAACGAA AAAGAGCGCG CCGAGCATGT GATGCTCGTA
GATCTTGGAC GGAACGATCT CGGCCGCGTG AGCGAATACG GCTCTGTCAA AGTAAAAGGC
CTGATGTACG TAGAGCGCTA CTCGCACGTG ATGCATCTCG TCTCCGCGCT CGAAGGCAAA
CTGCGCGGCG ACCTCGACGC GCTCGACGCC TTCGCCGCCT GCTTCCCCGC CGGCACCCTC
AGCGGCGCGC CCAAAGTCCG CGCCATGGAA ATCATCGAAG AACTGGAACC CACCCGTCGC
GGCGTCTACG GAGGTTCGGT TTTGTATGCC GACTTCGCCG GCAATCTCGA CTCCTGTATC
GCCATCCGCA CCATGGTCGT GAAAAACAAC CGCGCGTATG TCCAAGCCGG CGCCGGCATC
GTAGCCGACA GCGATCCCGA AAGCGAATTC CAGGAGTGCC GCAACAAAGC GCAAGCGGTC
GTCCGCGCCG CCGAACTGGC GGGACGATAG
 
Protein sequence
MDSPDFKSFS QLAREASLVP VTRTISADLL TPVSAFLALA DKEPYAFLLE SVEGGERIGR 
YTFLGIRPYM VVTGRGSEVT IRRGKKTEKS SSDLLGTLRA ALREHKPATV PGLPPFTAGA
VGYFAYDAVR HFERLPDIAK DDLHLPDGVF MFYDRLLAFD HLRHQLHLIA AADVRTEKPR
AAYDRAIADL DALEKKLVSG LKIRRLRPEK KTAKIKLHAR TKPADYMNAV KRGKEYIAAG
DVFQVVLSQR LDFALPAPPF DIYRSLRTVN PSPYMYFLRM DDLHVLGSSP EMLVKANNRT
LEYRPIAGTY KRGATAEEDA RLEEHLRTNE KERAEHVMLV DLGRNDLGRV SEYGSVKVKG
LMYVERYSHV MHLVSALEGK LRGDLDALDA FAACFPAGTL SGAPKVRAME IIEELEPTRR
GVYGGSVLYA DFAGNLDSCI AIRTMVVKNN RAYVQAGAGI VADSDPESEF QECRNKAQAV
VRAAELAGR