Gene Acid345_1735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1735 
Symbol 
ID4072002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2105186 
End bp2106292 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content61% 
IMG OID637983743 
Productglycine oxidase ThiO 
Protein accessionYP_590810 
Protein GI94968762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCC GTCGGCCAGA AGCATACACT ACGACCCTTT ACAATAAGGC CGTGCGCTCC 
CACGACGTCA TCGTCATCGG CGCCGGTGTC ATCGGCCTCT CGCTCGCCTG GAAGCTCCAG
AAATCCGGGC TTCGCGTTCT TGTTGTCGAG CGCGGCGAGC CCGGCCGCGA GGCCTCGTAC
GCAGCAGCTG GCATGATTGC CGTTTGCGAT CCGCACAACG ACCGCCTGAT GCAGCCGCTC
GCCGAGATCA GCGCCGCCAT GTACCCGGAG TTCGTCCATG AGCTCGAAGA CGAGTCCCAG
GAAAAAGTTG ATCTCCGCAA TGAAGGCGTG ATCACGTTCC TCGATCCGTC GTCATATTCG
GCGCTGCATC CGAAATCGAA AGCACTCACC GCAGAAGAAG TCGTAACTAT CGAGCCCGGC
GTCGTTCCGC GTGCGAATGC CTATTTCTTA CCCGAGAGCT GGGTCGATCC GCGGCTTCTC
TGCTCCGCAT TGCTGCGTGC TTTCAAACAT CGTGGCGGCG ACGTCGCCAG TGGTTCTCCG
GTTCTCTCAG TTACAGACAA TGAGGTTCGA ACCGAGCAGA CCATCTATCA CGCTGCAGCG
ATTGTGAACT GCGGCGGGGC GTGGGCCGGA CAGCTCATGC CCCCGAACCC AACCCGACCG
GTCAAAGGAC AAATGCTCTG TGTGGTTCCA CTCGCCCACC CCACGCCACA CCCGCCGGTG
CTGCGCCACG TCATCCGCGC CACCGATGTT TACCTCGTCC CTCGTAGCGA TGGACGAATC
ATTATCGGCT CCACCCTCGA AGAAGGGGGC TTCGACAAAC AAGTGAACCC GGACGTCATT
CAGCAGCTTC GTTCTTCCGC CGAATCCATC CTGCCGGCCG TGAAAGATAT GCGCACTCAT
GACGCATGGG CGGGCCTCCG GCCCGGCACT CCGGACGGAC GCCCCCTCCT CGGCAAATTG
GCGCCGAACC ATTACATCGC TGCCGGACAC TATCGTGATG GGATTCTTCT CGCGCCTGCG
ACCGCCGAGG TCATGGCCGC GCTTATTGAA GGTAGAACTC CGGAGATCGA TCTGCGCCCA
TTGGCTCCGA ATCGGTTCCA GCGCTGA
 
Protein sequence
MASRRPEAYT TTLYNKAVRS HDVIVIGAGV IGLSLAWKLQ KSGLRVLVVE RGEPGREASY 
AAAGMIAVCD PHNDRLMQPL AEISAAMYPE FVHELEDESQ EKVDLRNEGV ITFLDPSSYS
ALHPKSKALT AEEVVTIEPG VVPRANAYFL PESWVDPRLL CSALLRAFKH RGGDVASGSP
VLSVTDNEVR TEQTIYHAAA IVNCGGAWAG QLMPPNPTRP VKGQMLCVVP LAHPTPHPPV
LRHVIRATDV YLVPRSDGRI IIGSTLEEGG FDKQVNPDVI QQLRSSAESI LPAVKDMRTH
DAWAGLRPGT PDGRPLLGKL APNHYIAAGH YRDGILLAPA TAEVMAALIE GRTPEIDLRP
LAPNRFQR