Gene Acid345_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3011 
Symbol 
ID4071566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3570182 
End bp3571234 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content58% 
IMG OID637985030 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_592086 
Protein GI94970038 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000165502 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGACTCAAG CCGACATGGA TACCAAGCCC GTTACGTACG CCGACGCCGG CGTTGATATT 
GAAAAAGCCA ACCGCACCAA GCAGCGCATT AAGTATTTGG CGCACAAGAC GTTCACCAAG
AGCGTCCTGA GCGAGATTGG CGGCTTTGGC GGCCTTTTCC AGATCGATAA AAAGAAGTAC
CTGGACCCGG TGCTCGTTTC GAGCGTGGAC GGCGTAGGCA CGAAGCTGAA AATCGCATTC
GAAATGAACC TTCACCACAC GATTGGTGCG GACCTGGTCA ACCATTGCGT GAACGACATC
GCGGTGCAGG GCGCGGCGCC GATGTTCTTC ATGGACTACC TGGCAACCGG CAAACTGGAT
CCGGACATTG CGGAGAGGAT CGTTACCGGG CTCGCGGATG CTTGCAAGCA CAATGGCTGC
GCGCTGATCG GCGGCGAGAC GGCCGAGATG CCGGGCTTCT ATCCCGACGG CGAATACGAT
CTCGCTGGAT TCATCGTGGG AGTGGTCGAA CGCGATAAGG TCATCACTGG CAAAGAGGTT
GTGCCGGGAG ATGTGCTGGT CGGGCTGCCG TCGAATGGGC TGCATACGAA CGGATATTCG
CTCGCCCGGA AACTGCTCTT CTCCATCGCC GGATACTCGC CCGAAACGTA TGTAAATGCG
ATTAAAGGCA AGGTCGGCAA CGAGCTGATG AAGACGCACA AGAGCTACTG GCCTGCGGTC
CGTCGGCTGG TGGAGGCGGA GTGCGTAAGC GCGATGGCAC ACATTACAGG CGGCGGCATT
ACCGAGAACC TGCCGCGCGT GCTGCCCAAG GGCACGGGCG CGGTGGTGGA ACTGGGATCG
TGGCCGGTGC TGCCGATCTT TACCCACATG CAGCAGCTCG GGAATATCAG CCAGGACGAG
ATGCTCCGCA CCTTCAACAT GGGTATCGGG ATGGTGCTGG TGATTCCGGC GAAGAAGTTC
AAGAAGGTAC AGACAGTGCT GGAGCGCGCT GGGGAGAAGG GCTATACGAT CGGGCGCATT
GTGAAGGGCG ACCGAAAAGT CAGCTACTCG TAA
 
Protein sequence
MTQADMDTKP VTYADAGVDI EKANRTKQRI KYLAHKTFTK SVLSEIGGFG GLFQIDKKKY 
LDPVLVSSVD GVGTKLKIAF EMNLHHTIGA DLVNHCVNDI AVQGAAPMFF MDYLATGKLD
PDIAERIVTG LADACKHNGC ALIGGETAEM PGFYPDGEYD LAGFIVGVVE RDKVITGKEV
VPGDVLVGLP SNGLHTNGYS LARKLLFSIA GYSPETYVNA IKGKVGNELM KTHKSYWPAV
RRLVEAECVS AMAHITGGGI TENLPRVLPK GTGAVVELGS WPVLPIFTHM QQLGNISQDE
MLRTFNMGIG MVLVIPAKKF KKVQTVLERA GEKGYTIGRI VKGDRKVSYS