Gene Acid345_3885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3885 
Symbol 
ID4072220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4594076 
End bp4595119 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content62% 
IMG OID637985909 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_592959 
Protein GI94970911 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGG ACGCCCTACA TCAGATAGTC GTTCACCGGC GGGACCTTAC GCGCGAACAG 
GCGCGCGAAA CCATGGCTGA CGTCCTGGGA GGAAAAACCA CCGACGCGCA GATCGGGGCG
TTGCTCGTCG GGTTGCAGAT GAAGGGCGAG ACGGTGGATG AGATTGTCGG CTTCGCGGAG
GCGATCCGTG CGGCCGCGAC GCCGTTGATC GTGCGCGACT CAGCGCTCGA CGTGAGCGGC
ACCGAGCGCG ATGCGCTGCT CGATACCTGC GGCACCGGGG GCGATGCCAG CGGGACGTTC
AACATCTCGA CGGCGACGGC ATTAGTTGTG GCGGGTGCGG GCGTGAAGGT GGCGAAACAC
GGCAACCGTA GTGTGACTTC GAAGTGTGGG TCGGCGGATG TGGTCGAGGC GCTGGGAGTG
AACATCAACC TTCCGGCAGA ACGCATGGCG GAGTGCCTGG AGAAAGTCGG GATCGCGTTC
CTGTTTGCGC CGGCGATGCA CACGGCGATG AAGTATGTGC AGCCGGCGCG GCGTGAGTTG
AAGATGCGCA CGGTGTTCAA TCTGCTGGGA CCGCTCACGA ACCCGGCGAA TGCTTCATGC
CAGGTTGTAG GTGTGTACAC AGGGCAGCTT GTTGAGAAAC TGGCGCAGGC CCTCTTACAG
CTTGGGTTGA AGCGCGCGCT GGTGGTACAT GGGTGGGATG GACTGGATGA GATCACGATA
TCCGGCCCGA CGAAAGTTGC GGAAGTACGC GATGGAAAGG TGACATCGTA CGAGATTTCG
CCCGAACAGT TTGGACTGCA ACGCGCGCCG CTGAGTGCGC TCGAGGGCGG CGATGCGCAG
GTCAATGCTG CGATCATTCG CGCGATTCTT GATGGCGAGC GGTCTCCGAA GCGCGATGTT
GTGCTGCTGA ATGCTGCCGC GGCACTGGTG GCGGCGGGTC AGGCAGAGAC GATGGGAGCG
GCGATTCCCG TTGCGGCGTA TGCGATTGAT AGTGGGCAGG CGAAAGGGAG GCTGCGGTTG
CTGGTGGAGT TTACGAACCT ATAG
 
Protein sequence
MITDALHQIV VHRRDLTREQ ARETMADVLG GKTTDAQIGA LLVGLQMKGE TVDEIVGFAE 
AIRAAATPLI VRDSALDVSG TERDALLDTC GTGGDASGTF NISTATALVV AGAGVKVAKH
GNRSVTSKCG SADVVEALGV NINLPAERMA ECLEKVGIAF LFAPAMHTAM KYVQPARREL
KMRTVFNLLG PLTNPANASC QVVGVYTGQL VEKLAQALLQ LGLKRALVVH GWDGLDEITI
SGPTKVAEVR DGKVTSYEIS PEQFGLQRAP LSALEGGDAQ VNAAIIRAIL DGERSPKRDV
VLLNAAAALV AAGQAETMGA AIPVAAYAID SGQAKGRLRL LVEFTNL