Gene Acid345_3176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3176 
Symbol 
ID4071246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3766976 
End bp3769057 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content60% 
IMG OID637985196 
Productphage terminase GpA 
Protein accessionYP_592251 
Protein GI94970203 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.693881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTG CTCCAATCAT CGCCACACAG GATCCCGTCC TCGCCGTCTT CCAGGAGGCC 
GCACAGGTCT TCCGGCCTGC ACCGGCGCTC ACCATCACCG AGTGGGCAGA GCGTCATCGC
ATTCTCAGTA CCGAGAGTTC GGTCAGTGCC GGCCTCTATC GTTGCGAGGT CACGCCCTAT
GCGCGCGAGA TGCAGGATGC CATTAAGGAT CCCGACGTCG AAGAGGTTGT CTTCTGGACC
GCCGCGCAGA TGGGCAAGTC CACTTCGCAG GAGAACATCG CGGCCTATTT CATCTGCGAA
GATCCATGCC CAATCATCTG GATGTGGCCG ACCAAAGAAG TCGCGCGTGA TTGGTCGGTC
GATACACTCG ATCCACTACT TCGCGACTCG CCCGAGTTGT CGCGCCGTTT CACTGAAGGC
TCGCGCAAAT CATCGAACCG CGGGCTCTTT AAAAAGTTTC CCGGCGGGTA TCTCTCCGCG
ATCGGTGCCA ACTCGGCGTC AGGCCTGCGG CGCCGTCGAG CCCGTCTGCT CATCTGCGAC
GAAATCGACG GCAACCCGCC CAGCGCCGGC GACGAAGGCG ATCCCATCGA GATTGTCATC
TCCCGTGCTG AAACTTTCTG GAACCGCAAG CGCGTCCTGG CTTCTACCTG CACCAATAAG
GGTGAGTCGC GGATCGAAGG CCGCTATGAG ATCAGCTCGA AAGGGAAGTA CTGGGTGCCA
TGCACGAGCT GCGGTGAACT CATGCTCCTT TCCTTCCGGG GCCTCAAGTG GCCGAAGGGA
GAAGAGCCCA CCATCGAGAA TACGTATCTT CCGTGCGAAC ACTGCGGCGT TGTGCTCACC
GAGGCCGATA AACCTGCCAT GCTCGCCGCC GGCCGCTGGA TCCACGAGCA TCCCGAGCGC
AAAATCCGCG GCTATTGGAT CAACAAGATG TATTCGCCCT TCGTGGCCTG GTGGGAACTC
GCAGCCAAGT TCAAGCGTCT GAACGCGCGT ACCACGGAAG ACCGCGAGGC GCTCAAGCCT
TTCGTCAATC TCGATCTCGC TGAGACCTGG GAGGTAAAGG ACGAAAAGCC CGATCGTAAT
CGCTTGGTCG ATCGCCGTGA GACCTACGAA ATACTCCGCG AGCAGCGCGA AGAAGGCGCG
CCCGCCAGCC AATCGAAGCT TGTCCAGGTC TCGCTCCTGC CGGATAGTGT CACCGTCCTT
ACCTGCTCCG TGGACGTGCA GGTCGATCGC CTTGAGTTCG AAATCGTCGG ATGGGGTCAC
AAGCGCGAAA GTTGGTCGAT CTATGTTGGC AACGTTCCCG GAGATCCCAA GAACGAAGCC
GTATGGCTGC GCCTCGATCA AATTCTCCAG ATGGAGCTGC AGCACCATCG CGGGTCCATG
CTGCCGATCG CGGCCACCTT CGTCGATTCC GGCTTCGACG CCCCCGAGGT CTACAACTTC
ACCAAGCCGC GCGCCTATCG CTGGGTGTTT GCCTCGAAGG GCTCGTCGGA GTTCAATCAC
GTCCCGCTGG CGAAGAAGAA GCACATCGAT CGCAGCAACG TGTGGCTCTA CCAGGTCGGC
GTCGGTCAAA TCAAAAAGAC GATCTATGCC AACCTCATGG TCACCGCGCC TGGGCCGGCG
TACATGCACT TCACCACCGC GCACAACACG CCCGAGTACT TCGATCAGCT CACCGCCGAA
ACTCTTGAGA GCTACTACGA GCACGGTTTT CCACGGAAGC GCTGGAAGAA GCAGCCCGGC
GCTCGCAACG AGGCTCTCGA TCTTCGGGTG TACAACTACG CCGCGTTCCT GTCGCTGAGC
GAGCAGCCCG ATAAGCTCCT CGATCGCCTG CGCGAGCAGT TGCTGCTCGA CGCGAAGAAA
CTGGAAGATG CGGGCGCGAA AGAGAACCAG TTGCCGCTCA TCTCCACCGA GCTCCCGCCT
TCCACGCCTC CCGAGCCCGT CGATCGCGCG GAAGCGACTG CAGCGACAGC CGAAAAGCTC
GCCGAGACGC TCACGGCCGC CCTCGCGCCT CCGCCTGTTG TAGCGCCTCC TTCTGAATCG
TTCACGCCAA GGGTCAAGGT CAAGCGCTCT AGTTGGCTCT AG
 
Protein sequence
MTSAPIIATQ DPVLAVFQEA AQVFRPAPAL TITEWAERHR ILSTESSVSA GLYRCEVTPY 
AREMQDAIKD PDVEEVVFWT AAQMGKSTSQ ENIAAYFICE DPCPIIWMWP TKEVARDWSV
DTLDPLLRDS PELSRRFTEG SRKSSNRGLF KKFPGGYLSA IGANSASGLR RRRARLLICD
EIDGNPPSAG DEGDPIEIVI SRAETFWNRK RVLASTCTNK GESRIEGRYE ISSKGKYWVP
CTSCGELMLL SFRGLKWPKG EEPTIENTYL PCEHCGVVLT EADKPAMLAA GRWIHEHPER
KIRGYWINKM YSPFVAWWEL AAKFKRLNAR TTEDREALKP FVNLDLAETW EVKDEKPDRN
RLVDRRETYE ILREQREEGA PASQSKLVQV SLLPDSVTVL TCSVDVQVDR LEFEIVGWGH
KRESWSIYVG NVPGDPKNEA VWLRLDQILQ MELQHHRGSM LPIAATFVDS GFDAPEVYNF
TKPRAYRWVF ASKGSSEFNH VPLAKKKHID RSNVWLYQVG VGQIKKTIYA NLMVTAPGPA
YMHFTTAHNT PEYFDQLTAE TLESYYEHGF PRKRWKKQPG ARNEALDLRV YNYAAFLSLS
EQPDKLLDRL REQLLLDAKK LEDAGAKENQ LPLISTELPP STPPEPVDRA EATAATAEKL
AETLTAALAP PPVVAPPSES FTPRVKVKRS SWL