Gene Acid345_3810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3810 
Symbol 
ID4071094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4504766 
End bp4506226 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID637985833 
ProductD-beta-D-heptose 1-phosphate adenylyltransferase / D-alpha,beta-D-heptose 7-phosphate 1-kinase 
Protein accessionYP_592884 
Protein GI94970836 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2870] ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase 
TIGRFAM ID[TIGR00125] cytidyltransferase-related domain
[TIGR02198] rfaE bifunctional protein, domain I
[TIGR02199] rfaE bifunctional protein, domain II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.325823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.269725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCA ACGTAGCGGA TCTCCTTCAT ACGATTAAGA ACAATTGGTG CGATCGCTCA 
ATCCTCGTAG TCGGCGATGT GATGCTCGAC CAATATATCT GGGGTGACGT AGGTCGTATC
TCACCGGAAG CGCCGGTTCC GATTGTGCGA GCTACGCATC GCACGGAACA ACCTGGCGGA
GCCGCCAACG TCGCGCTGAA TATCGCTCGG TTGGGCGCAC GTGCGACGAT TGTCGGATTC
ACCGGGACGG ACGATAACGA ACGGGCGCTG AAGGATTACC TTTCCTCGAA TCGGGTCGAG
GCTGATTTCG TGAGCTGCGA AGGCTTCCCA ACGATCACGA AATTACGAAT CCTATCTGGT
CGGCAACAAA TGCTTCGGCT CGACAACGAA CGAGCTGAGT CGCGGCCCAG TACTGCATAC
CAGAAGCTCA TCGAACGCGC TCTCCATCAC CTGCCGCAGA GCGATGCACT CATACTTTCG
GATTATGCGA AGGGCGTGTT GCTTCCGGAA GTATGTCAGA CACTCATACA AGCCGCCGCC
CAACGTAAGA TTCCGGTCCT CGTCGATCCC AAGAATGTCG ACTTCAGTAG GTACCGCGGG
GCGACAACAA TTTCACCGAA TCTGGGAGAG CTTGCGTTAG CGGCTCGCGT CGACCTTGAA
AATCTGAACG ATTTACTGTA TGCGGCGCAG CAGATGGTAC GCAATCTGGG ATTGAGCTTT
CTTACGGCGA CCCTTGGCGA AAAGGGAATC GCGTTAGTTA CGCGGGATAA GACGACAATT
TCGGCAGCGG TCGCTCGACA GGTATTTGAT GTCTCCGGTG CGGGAGACGC TGTTATTGCT
ACGTTGAGCT TGTCGCTAGC GTCGGGGCTG GATCCGGAGC TCGGCGTTCA CTTGGCGAAC
CTCGCTGGGG CAATCGTTGT AAGCAAAGTT GGGACGGCGC CAGTAGAACA GTACGAATTG
TTGAATGCTC TCACAGCGGA ATCAGTGCCG GTCGCCCAAG CAAAGGTCGT GACACGTTCG
GAGCTTCTCG AACTCGTGGC CCGCTGGAGG CGAAATGACG AGCGGATCGT CGTTACGAAC
GGCTGCTTCG ACTTGCTGCA CGTCGGCCAC ATATCGCTTT TGGAGCAGGC GCGCGGATTC
GGAGACCGAC TCGTCGTCGC GATCAATAGC GATCGGTCGG TGCGAGAATT AAAAGGGAAT
AGTCGTCCAA TTGTGGGAGA ACAAGAGCGC GCCCGAGTCC TCGCGGCTAT CGCGGCGGTT
GATGCAGTGG TCATATTCGA TGAGCGTACG CCTCTCGAAT TGATTGAGGC GACGCGCCCT
GACGTGCTGG TAAAGGGAGG AGACTATGCA GTGAGCGGAG TAGTGGGAGC TGAGGAGGTG
CAGTCCTGGG GCGGGCACGT AAAGATTGTT CCAATCGTTG AAGGCTTCTC GACAACAAAG
TTGATCGAAA AGGGGCACTA G
 
Protein sequence
MSTNVADLLH TIKNNWCDRS ILVVGDVMLD QYIWGDVGRI SPEAPVPIVR ATHRTEQPGG 
AANVALNIAR LGARATIVGF TGTDDNERAL KDYLSSNRVE ADFVSCEGFP TITKLRILSG
RQQMLRLDNE RAESRPSTAY QKLIERALHH LPQSDALILS DYAKGVLLPE VCQTLIQAAA
QRKIPVLVDP KNVDFSRYRG ATTISPNLGE LALAARVDLE NLNDLLYAAQ QMVRNLGLSF
LTATLGEKGI ALVTRDKTTI SAAVARQVFD VSGAGDAVIA TLSLSLASGL DPELGVHLAN
LAGAIVVSKV GTAPVEQYEL LNALTAESVP VAQAKVVTRS ELLELVARWR RNDERIVVTN
GCFDLLHVGH ISLLEQARGF GDRLVVAINS DRSVRELKGN SRPIVGEQER ARVLAAIAAV
DAVVIFDERT PLELIEATRP DVLVKGGDYA VSGVVGAEEV QSWGGHVKIV PIVEGFSTTK
LIEKGH