Gene Acid345_2477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2477 
Symbol 
ID4072101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2933234 
End bp2934250 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content60% 
IMG OID637984494 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_591552 
Protein GI94969504 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.509569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGTTCC AAGAACGGTA CTCCCGGCAA ATTCTGTTCC ACGGCATTGG GGCTGAAGGG 
CAGCAGAGGC TGGCCGCGGG ACGGGCAGTA ATCGTCGGTT GCGGAGCAAC CGGTTCGGCG
CTGGCCTCCC TTTTGGCGCG CGCCGGCGTG GGCTATTTGC GAATCGTGGA TCGCGATTAC
GTAGAGCCGA GCAATCTGCA AAGACAGGGT CTGTTCGACG AGAACGATGC CGCCGAGGCG
CTTCCGAAGG CCATCGCGGC AGCGCGAAAA ATCCATGCGT TTAACAGCGA GATCACGGTT
GAACCTCATG TGGATGACCT GACTCCCGAC AACGCCGACG ATCTACTCGC GAACGTGCAA
TTGATCCTCG ACGGAACCGA CAACTTCGAG ACGCGCTATC TGATTAACGA CTACGCCGTT
AAGAACGCCG TGCCGTGGAT CTACTCCGCG GCCGTAGGCA GCTACGGCGT GGCAATGAAT
ATCCTGCCCG GCGAAACGGC TTGCCTGGCA TGCGTTTTCC CCGATTCGCC GCGCGGTGTG
GTCGAGACGT GCGATACCTC AGGAATCTTG AACACCGCTG TGAACGAAGT GGCATCGCTC
TCAGCGACGG AAGCGTTGAA ATATTTTGTC GGCGCCCGGG AGAAGATGCG ACGGACGCTG
GTCTCAACCG ATGTCTGGAC CAATGAACGG TCGGAGATTC GGACCGGCGG ACCGAAACCT
GGCTGCCGGT GCTGCGGGAA GCGCGACTTT AGCCACCTGT CCGGGGAGGG GCGTCCGCAT
ATTTCCTTGT GCGGACGCAA TTCGGTGCAG ATCCATGAGC GGCAGAGGCC GATCGACTTC
GCGATGATGG AGACACGACT GCGGCCGCAT GGCCAGGTAC GCCACAACGA ATTCGCGCTG
CGGTTTTTCC ACGAACCTTT CGAGATGACC CTGTTCCCGG ACGGACGGGC GATCATTAAG
GGGACGACGG ATATTGGCGT GGCTCGAAGT TTGTATGCGC GGTTCGTGGG ATCGTAG
 
Protein sequence
MQFQERYSRQ ILFHGIGAEG QQRLAAGRAV IVGCGATGSA LASLLARAGV GYLRIVDRDY 
VEPSNLQRQG LFDENDAAEA LPKAIAAARK IHAFNSEITV EPHVDDLTPD NADDLLANVQ
LILDGTDNFE TRYLINDYAV KNAVPWIYSA AVGSYGVAMN ILPGETACLA CVFPDSPRGV
VETCDTSGIL NTAVNEVASL SATEALKYFV GAREKMRRTL VSTDVWTNER SEIRTGGPKP
GCRCCGKRDF SHLSGEGRPH ISLCGRNSVQ IHERQRPIDF AMMETRLRPH GQVRHNEFAL
RFFHEPFEMT LFPDGRAIIK GTTDIGVARS LYARFVGS