Gene Acid345_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1077 
Symbol 
ID4070037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1351345 
End bp1352664 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content60% 
IMG OID637983086 
Productguanosine polyphosphate pyrophosphohydrolases/synthetases 
Protein accessionYP_590154 
Protein GI94968106 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.163028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0154714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAC TGATTCAATC GGCCAAACAG TTTGCGCTGC AGCAAACCGA GCGTATTAGC 
CAAAGCCGGC ACCCCAAGCA GCAGACCGCC GAGAACCATC TCAAAGCAGT TGCGCAAAAC
GTCGCCTCGG TCACTTCCGA CCGCAACGCA ATCGCTGCCG CATGGCTTCA CGACATCGTG
GGAGACACTC CTGTCACCCT CGGTATGATC GAGCGACGCT TCGGTGCCGA TATCGCGCGA
CTCGTGCATG AACTCACGCC GGTAAGTCGG CCTGGTGATG GCGACCGCGC TGCGCGTTTT
GCCAAAGACA AGCGCCACTT CGCCGCAATC TCACCAACCG CGAAACTGGT GAAACTGGCC
GACATGATCG ACACCTGCCG CGACCTGCGC GCCGCTGATC CAGCAGTCGC TCGCCCCTTT
CTTCTCGAGG TTACAGAGCT GCTTCCGGTG CTCGAAGACG GCGACATTCG TCTTGCTGCC
CGTCTTCGGA AAGAACTGCA AAGAGCTCCC AAAACCTTGG GTGATGTCGA AGCAACACCT
CCCCCGCGGC TGGAGCCTCT CGCCATTTCG CTCAACGCTC TACGGGTTTT CGAGCGCGCC
TTCAGCGCCT GGGACATCGC CGATCCCCTG CTCCTGTTCC ATGCGGATGC CGACGCCGCG
GAATGTCGTC ACGAGATCGA AGCAGCGCGA GAAGAAGTCG CCGCGGTGTG GCAAGACGGC
GCTCTGCGCG GATACGTCAC CGGCTCTGAA TTGAAGGAAG GAACCTGCGC CGGTTACGTG
CGGGCCATCG CGCCGGACCA GTTGCTCGAT GCCGACGGTT CACTCACCGA CGCCATTGAA
ATCCTGACGC GTTACGATGC CTGCTTCGTA ACTTGGGACG GCGAGCCAAG GGGCGCCATT
ACACGGGTTG ACGCCCATAA GCCTGCGGTG CGCATGTGGC TCTTCGGGAT CATCACCGTC
ATCGAAATGG AGTTCACCGA ACGCGTTCGC CAGCAGTGGC CGGCAGGTGG TTGGTCCACG
CTAGTCTCGG CAGGCCGTTT GGAGAAGGCG CGGCAACTCT TTGCCGAGTG TACGCGTCGC
CACGAAAAAT GCGAGCTCCT CGATTGCCTG CAACTCGGCG ACAAAATTCA GATTCTCATC
AGCGACCCCG CATCTCTCGC ACTCATCGAC ATTCCCACTG CAAACGCAGC CAAACGCATC
ACTGCGCAGA TTGAATCATT ACGGAATAAA CTTGCGCATT CACAGGATTT TATCGATCAG
GACTGGCCAC AGGTGGTGCG TCTCGCGCGC CGCGTTGAGC ACATGGTGCA GCAGTTCTAG
 
Protein sequence
MPELIQSAKQ FALQQTERIS QSRHPKQQTA ENHLKAVAQN VASVTSDRNA IAAAWLHDIV 
GDTPVTLGMI ERRFGADIAR LVHELTPVSR PGDGDRAARF AKDKRHFAAI SPTAKLVKLA
DMIDTCRDLR AADPAVARPF LLEVTELLPV LEDGDIRLAA RLRKELQRAP KTLGDVEATP
PPRLEPLAIS LNALRVFERA FSAWDIADPL LLFHADADAA ECRHEIEAAR EEVAAVWQDG
ALRGYVTGSE LKEGTCAGYV RAIAPDQLLD ADGSLTDAIE ILTRYDACFV TWDGEPRGAI
TRVDAHKPAV RMWLFGIITV IEMEFTERVR QQWPAGGWST LVSAGRLEKA RQLFAECTRR
HEKCELLDCL QLGDKIQILI SDPASLALID IPTANAAKRI TAQIESLRNK LAHSQDFIDQ
DWPQVVRLAR RVEHMVQQF