Gene Acid345_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0203 
Symbol 
ID4069672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp215221 
End bp216348 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID637982203 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_589282 
Protein GI94967234 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.357554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.334206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCT CCCAACCCGA ACTCCTCCGC TACAGCCGCC ACCTCTCGCT CCCCGAGTTT 
GGCCTGGAAG CCCAAGAGCG CCTGAAGCAA ACCAAGGTCC TCTGCATCGG CACCGGAGGC
CTGGGCTCCC CTCTCGCCAT GTATCTCGCC GCCGCGGGCG TAGGCACGCT AGGCCTCGTA
GATTTCGACA TCGTCGATTA CACCAACCTC CAGCGGCAGA TCATCCACAC CACGCCCGAC
GTCGGCCGCC CCAAGGTCGA ATCCGCCGCC GAAAAAATCG ACGTCCTCAA TCCGTTCGTC
AACGTCGTGC CCATCAACGC GAAGCTCACC AGCGCCAACG CCCTCGAGCT CTTCTCGCAA
TACGACATCA TCGCCGACGG CACCGACAAC TTCGCCACTC GCTATCTCGT CAACGACGCC
TGCGTTCTCA CCGGACGCCC CAACGTCTAC GCCTCCGTCT TCCGCTTCGA AGGCCAGGCC
AGCGTCTTCG CCACCAAGTC CGGCCCGTGC TACCGCTGCC TCTATCCCGA GCCGCCCCCA
CCCGGCACGG TCCCGAGTTG CGCCGAAGGT GGTGTCCTCG GTGTGCTCCC GGGCCTGCTC
GGCATCATCC AGGCCACGGA GGTCATCAAG GTCGCGTGCG GCATCGGCCA GCCGCTCATC
GGACGCATGC TGCTCGTCGA CGCCTCGACG ATGAAGTTCC AGGAACTGCG CCTCAAGCGC
GACTACCAGT GCCCCGCGTG CGGCACCCGT ACGCTCAAAG AGTTGATCGA CTATGATCAG
TTCTGCGGAA TCCGTGGACA GGAGAGCTCT GTGGCTGGAG ACATCACCGT AGAAGAACTA
AAACGCCGCC TCGACGCCGG CGAAAAACCT TTCATCCTCG ACGTGCGCGA GCCGCACGAA
TACCAGATCG CCAATCTTGG CGGCCACCTC ATCCCGCTCA ACGACCTCCC GAAGCGCATC
GGCGAACTCG ATCCCACCCA GGAAATCATT ACCCACTGCA AAATGGGCGG CCGCAGCCAG
CAAGCCGTCG ACTTCCTCCG CCAGCAAGGC TTCAAGAACG CGAAGAACCT AACCGGCGGT
ATCAACGCCT GGTCCGAAAA GGTCGATCCG AAAATTCCGA AGTATTAA
 
Protein sequence
MTLSQPELLR YSRHLSLPEF GLEAQERLKQ TKVLCIGTGG LGSPLAMYLA AAGVGTLGLV 
DFDIVDYTNL QRQIIHTTPD VGRPKVESAA EKIDVLNPFV NVVPINAKLT SANALELFSQ
YDIIADGTDN FATRYLVNDA CVLTGRPNVY ASVFRFEGQA SVFATKSGPC YRCLYPEPPP
PGTVPSCAEG GVLGVLPGLL GIIQATEVIK VACGIGQPLI GRMLLVDAST MKFQELRLKR
DYQCPACGTR TLKELIDYDQ FCGIRGQESS VAGDITVEEL KRRLDAGEKP FILDVREPHE
YQIANLGGHL IPLNDLPKRI GELDPTQEII THCKMGGRSQ QAVDFLRQQG FKNAKNLTGG
INAWSEKVDP KIPKY