Gene Acid345_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3937 
Symbol 
ID4071320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4656455 
End bp4657885 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID637985963 
Producthypothetical protein 
Protein accessionYP_593011 
Protein GI94970963 
COG category[R] General function prediction only 
COG ID[COG1660] Predicted P-loop-containing kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.93988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCC TCGCCCTCCT CTTCGAGCAG CACTTCGGCA GTGCACCCAC GCGCATGCAT 
CCCGTGCAAG GCGGACTCGG CGGCTCCGGA CGCATCATTA CCCGCCTCGC CAACGACACG
CATTCGGCCA TCGGCATCCT CAACGAAAAC ACCAAAGAGA ACGCTGCCTT CCTCGAGTTC
TCTGACCATT TCCGGAAGTA CGACCTCGCG GTCCCCGAAA TCTATCGCGT AGCCGAGACT
CGCAACGCGT ACCTCGAACA AGATCTCGGC GACACTACGC TCTTCCACTT CCTCGCGCGG
AACCGTAGCG GGTCTGAGAT CGCTCCCGAG GCCGTCAACG CCTATCGCAA AGTTGTGGAA
GCTCTGCCAC GCTTCCAGGT CGTCGCCGGG CGCGACCTCG ACTACTCCGT CTGCTACCCG
CGCCCGAGCT TCGACCGCCG CTCCATCGCG TGGGACCTGA ACTATTTCAA GTACTACTTC
CTTAAGCTGT CTGAAATTCC GTTCCACGAA GAGGCGCTCG AAGAAGATTT CGACAAGCTC
ACCGAATACC TTCTCAGCGC TCGGCGCGAT TACTTCCTTT ACCGCGACTT TCAATCACGC
AACGTCATGC TGCACGACGG CCAGCCCTAT TTCCTCGATT ACCAGGGTGG ACGCCACGGC
GCGCTGCAGT ACGACATCGC TTCGCTCCTC TTCGACGCGA AAGCCGAGTT GCCACCCGCT
TTGCGCGAAG AGCTGCTCAA TCACTATCTC GACGCACTTG CCGAGCACAT CCCTGTCGAT
CGACAGGACT TCCTCGCGCA TTACTATCCC TACGTTTACA TCCGCATCAT GCAGGCACTC
GGGGCCTACG GCTACCGCGG CTTCTTCGAG CGCAAAGTGC ACTTCCTGCA AAGCGTGCCG
TATGCGTTGC AGAACATCCG GTGGCTGCTC CACAACGTAA CGCTGCCGAT CGAATTGCCT
GCATTGATGG AAGCCTTCTC CGCCATGCTC GGCTCGGAAA AACTGCAGAA GCTCGCGATC
ACCGAGAAGA AGGAGCTCAC GATCGTCGTC ACCAGTTTCT CTTTCCATCG CGGACCGGTG
CAGGATGAGA GCGGCAACGG CGGCGGCTTT GTCTTCGACG CCCGTGCCCT CCCCAATCCT
GGACGCGAGG AGCAATTCAA GAAGCTCAGT GGCCGCGATG CCGAAGTGAT CGAATATCTT
GAGGCCGAAG AATCTGTCAG CCAATACCTC GAGAACGCGA TGAACATGGT CAACGCCAGC
GTGCGCGCCT ACAAAAAGCG CCGCTTCACC CACCTGATGG TTTCGTATGG CTGCACCGGC
GGCCAGCACC GCTCGGTCTA TCTCGCCGAG CAGACGGCGA AACGACTCGC CGGAATTGAC
GGATTAAAAG TCATTCTGCG CCACCGCGAA GAGGAGAGTT GGGTCCGATG A
 
Protein sequence
MDTLALLFEQ HFGSAPTRMH PVQGGLGGSG RIITRLANDT HSAIGILNEN TKENAAFLEF 
SDHFRKYDLA VPEIYRVAET RNAYLEQDLG DTTLFHFLAR NRSGSEIAPE AVNAYRKVVE
ALPRFQVVAG RDLDYSVCYP RPSFDRRSIA WDLNYFKYYF LKLSEIPFHE EALEEDFDKL
TEYLLSARRD YFLYRDFQSR NVMLHDGQPY FLDYQGGRHG ALQYDIASLL FDAKAELPPA
LREELLNHYL DALAEHIPVD RQDFLAHYYP YVYIRIMQAL GAYGYRGFFE RKVHFLQSVP
YALQNIRWLL HNVTLPIELP ALMEAFSAML GSEKLQKLAI TEKKELTIVV TSFSFHRGPV
QDESGNGGGF VFDARALPNP GREEQFKKLS GRDAEVIEYL EAEESVSQYL ENAMNMVNAS
VRAYKKRRFT HLMVSYGCTG GQHRSVYLAE QTAKRLAGID GLKVILRHRE EESWVR