Gene Acid345_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3547 
Symbol 
ID4069279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4196927 
End bp4198750 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content58% 
IMG OID637985570 
Producthypothetical protein 
Protein accessionYP_592622 
Protein GI94970574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTATC CCGCGTCTGT GTTTGGGCCT GCCGTACGTA ACCAAGTTCT CCGTTCCCTT 
TTATTGATTG CTCTCATTCC GGCTGCTCTT TTCTTGAGCG CCTGCGGCGG CAGTAGTTCC
AGCACGACAG CAACTACCGG AGTCGCGCCA GTCTTCACCA GCACGGCTCC GACCATCGCG
CGCGAAGGCG TTCTGTATAC CTACGACGTC ACCACGACGA CGTCGGACGG CAGCACGGTG
ACCTACGCGG CAACCACTGT CCCGAGCGGC GCCACCTTCG ATGGCGCCAC CTTGAAGTGG
ACGCCTACCC ACGCCGAATC GCGTATCTCG AACTCATTCA CAATCACTGC TACCACCAGC
AATAACGGAA CCGCCACGCA GTTCTTCAGC GTCACGCCAA ATGGCAATAT TGACGGTACC
GCCGTTGATC ACGCCGTCAC GGGCAGTGGC TTGAAAAATT ACAACCAGGA CCTCAGCGGT
TCTGTCGTCG AAGCCCTCGT TCCCGATGGC AAGGGTGGCT ACAACACGGT GAGAGGATCG
GGCAAGGATG ACGGCACCTT CAGTGTGGGC AACATCGGCA CCGGCAGCTT CTGGCTGCAT
GTGCAGCAGC CGGAGGTCGG TACTCTCCAG GACAATTACA TCTGGACCAA CGCCAGTGAT
GTCGACCTCG GCATGCTGCT CGGCCAACGA CCGGATGTTG TGCAGGAGAA GCTCGGCCAG
ACCATCACGA CAAGCTTCGA TCTCGCCGTT GCCCCTAAGA GCGAAGACTC TCTCGCGTGG
GCAAGTCCAG ATGCAGGTGC TTTTGGAAAC GGCCTGCCGA CTTCTTTCAC GCAACATCTT
GTGTCAACGT TCCCGCAGTC GGGCGGCCTC ATCGATAGTG CCAAGGGAGA TCGCGGCTTC
TTTGTTCATT ACTCGCCAAC GTCCACTGGC CTCGGCGTAG CCGTTGATGC TGCCGAATAC
GACAGTATTA CGCAGACTGA TGGCGGAACA ACGAACCTGA CCACCAATAC CGCTGCGCTC
AGCGGAACCA GCACCGCCAA TCCGGTGATC AAGATCACGC AGTGGGACGC GCTGTATGCG
GGTCTGCCTG GGGTTACCCC TCTTCTAAAA GAGTTCGACT TCTACGACGC ACGCTATCCC
GGAACGGAAG GCCCCGCGGG CGGTATCGAC ATTGCCTATG GTCCAGACCT GCGAAACGTC
ACCACGGACA CAGACCTGGG CAGTTTTAGC TATGCCATGA TTTCCAAGAC CGGCGTTCCC
TACACCCAGT TTCTCGACTA CGGCCTCCGC ATCATTAATG TCGGTTCGAG CAATTTTGAA
TTTGTCGTCG GCGGCGCAAT CTTCACCAAC GCAGTGCCGA CCTCTGCCAC TCCGATCGTT
CCGGTTATAA GTTTGCCGCG CTCCGTAACC GTGGATGGCA AAGATTTCCT GAGCGACCAG
ACCAACATTT CGTTGAGCCC ACAAATCTCG TGGTCCACTC CGTCGACGGG CACTCCAACG
TCGTACGCTT TGTACGTCTA CGACACAAGC AAGTTCAACG CGATCGCGTC GTTTTATACC
AATGGAAACA GCGTGACTGT GCCGGCCGGG ATGCTGCACG CCGGCTCCAC CTACATCTTT
TATTTGGAGG CTTTCCTGTC CCAGAGTACG ACGTTTGCAA CAGCGCCGTT TCGCACAGGA
ACCAGCCAGG CAATCTCGTT TGTCGTTTCC GGCATAATGA CCACGGCCGG AGGCGCCAGT
GCGTCAGGCG TGCCCTCCGA GACGAAACAG AAGTTTAGGG TTACGCCTCG TTTCGTGGGA
GCACCGAAGG TTGCAAAGCA ATAG
 
Protein sequence
MPYPASVFGP AVRNQVLRSL LLIALIPAAL FLSACGGSSS STTATTGVAP VFTSTAPTIA 
REGVLYTYDV TTTTSDGSTV TYAATTVPSG ATFDGATLKW TPTHAESRIS NSFTITATTS
NNGTATQFFS VTPNGNIDGT AVDHAVTGSG LKNYNQDLSG SVVEALVPDG KGGYNTVRGS
GKDDGTFSVG NIGTGSFWLH VQQPEVGTLQ DNYIWTNASD VDLGMLLGQR PDVVQEKLGQ
TITTSFDLAV APKSEDSLAW ASPDAGAFGN GLPTSFTQHL VSTFPQSGGL IDSAKGDRGF
FVHYSPTSTG LGVAVDAAEY DSITQTDGGT TNLTTNTAAL SGTSTANPVI KITQWDALYA
GLPGVTPLLK EFDFYDARYP GTEGPAGGID IAYGPDLRNV TTDTDLGSFS YAMISKTGVP
YTQFLDYGLR IINVGSSNFE FVVGGAIFTN AVPTSATPIV PVISLPRSVT VDGKDFLSDQ
TNISLSPQIS WSTPSTGTPT SYALYVYDTS KFNAIASFYT NGNSVTVPAG MLHAGSTYIF
YLEAFLSQST TFATAPFRTG TSQAISFVVS GIMTTAGGAS ASGVPSETKQ KFRVTPRFVG
APKVAKQ