Gene Acid345_0985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0985 
Symbol 
ID4068652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1250915 
End bp1251814 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content65% 
IMG OID637982992 
ProductDNA-(apurinic or apyrimidinic site) lyase / formamidopyrimidine-DNA glycosylase 
Protein accessionYP_590062 
Protein GI94968014 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000818098 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCGGAAC TCCCCGACAT CACCGCGTAC CTCACCGCGC TAGAACCGCG CGTGCTTGGC 
AAAACGCTCC AGCGCGTCCG CATCACCAGC CCCTTCCTCC TCCGCACCAT CGATCCGCCG
CTGGAAGCCG TGGAAAGGAA GCGCGTGCTC GCGTTGCGGC GCATCGGCAA ACGCATCGTC
TTTGGATTGG AAGACGATCT CTGGCTGGTC TTGCACCTGA TGATCGCCGG CCGCCTTCAC
TGGAAAGCCG CGGGCGTCGC GCTGAAAGGC CGAAATTACC TTGCCGCGCT GGATTTCGAC
GACGGATCGC TGGTACTCAC CGAGGCCGGC GCGAAACGAC GCGCGTCCTT GCATCTTTTC
CGCGGCGAAG CCGCGCTGCG CACCGTGGAT CCCGGTGGAA TTGACGTCTT CACCGCCGAT
CTCGACGCAT TTCGCGCGGC GCTCACCATA GAAAACCGCA CCTTGAAGCG CGCCCTCACC
GATCCGCGCT TCCTCAGTGG CATCGGCAAC GCCTATTCCG ACGAAATCCT CTGGGCCGCG
CAGCTCTCAC CCATCGCGCA AACCCACAAG CTGAAATCCG ACGAATGGCA GCGCCTCTAC
GACGCCACCC GCGCGACGCT GCAAACCTGG ATCGACCGCT TCGCCGCCGA GGCCGCGAAG
AAATTCCCGG AGAAAGTCAC CGCGTTCCGT CCGGAGATGG CCGTCCACGG CAAGTACGGC
GAGCCCTGCC CGCGCTGCGG CGAGAAAGTC CTACGCATCC GCTACGCCGA CAACGAAACC
AACTACTGCG CGCGTTGCCA GACCGGCGGC CGCGTCCTCG CCGACCGCAG CCTATCGCGC
CTTCTCGGCT CCGATTGGCC GCGCACTCTC GACGAACTCG AGGCCCTCAA GCACCGCTAG
 
Protein sequence
MPELPDITAY LTALEPRVLG KTLQRVRITS PFLLRTIDPP LEAVERKRVL ALRRIGKRIV 
FGLEDDLWLV LHLMIAGRLH WKAAGVALKG RNYLAALDFD DGSLVLTEAG AKRRASLHLF
RGEAALRTVD PGGIDVFTAD LDAFRAALTI ENRTLKRALT DPRFLSGIGN AYSDEILWAA
QLSPIAQTHK LKSDEWQRLY DATRATLQTW IDRFAAEAAK KFPEKVTAFR PEMAVHGKYG
EPCPRCGEKV LRIRYADNET NYCARCQTGG RVLADRSLSR LLGSDWPRTL DELEALKHR