Gene Acid345_4013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4013 
Symbol 
ID4071149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4741984 
End bp4744356 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content63% 
IMG OID637986040 
Producthypothetical protein 
Protein accessionYP_593087 
Protein GI94971039 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTCA TGAGGGGGGC ATTTCTGAAC GCAGTCCTGG CGACAGCAGT CACCGCCGGG 
TTCGCGTTCG CGCAGGACGC GCCACCGCCG CCGGATTCTC AGTATCCGGC GACGAGCAAT
TCTGCACAGA ATGATTCCGG ACAGTACGCC ACACCGGATT CCAACACCAT TCCGCAGTAT
TCCGCGCCGT CACCAAAGTA TTCGTCACCT TCGGCAGATC AGCCCGAGGA AGGCGGATAT
CCGCAATCGG CACAGTCGCA AAATGCGCCT CCGCCACCGG CAGACGCGCA GCAGGCGAAT
GCGAACGGTG AGGACAACGA CGACTCGCAG GACCCATCGC GCCGCGTAGC GCGTATGCAG
TTCATGGACG GGCAAGTTTC CATCCAGCCC GGCGGCGTGA ACGACTGGGT TGCGGGAACG
CTGAACCGCC CGATGACCAC CGGCGATAAC GTTTGGACCG ACCAGAATTC GAAAGCCGAA
TTGAACGTCG GTACCGGCAC GTTCCGCATG GGCGCGGAAA CCAGCGTCAC GCTGGCGAAT
GTTGCCGATA AGACCACGCA GTTGCAGGTG CACCAGGGCA CGCTGAATTT GCGCGTGCGT
CACCTGTACG ACGGCGAAAC CTACGAGATC GACACGCCGA ACATGGCGTT CACCGTGCAG
AAGCCCGGCG ATTACCGCTT TGACGTGGAC CCGAACGGCG ACACTTCGTT CGTCACGGTT
TGGAAGGGCG AAGGCAACGC CACCGGCGAC GGACCATCCG TAGCGGTGCG TCAGGGTGAA
AAGGCGAAGT TCTCGAATGG AACTTCGATG GCGTACACGG TGGATCGCGC GCCCGGACAA
GATGAGTTCG ACGAGTGGGC GGTCGCACGC GATCGCCACG ACGAGAATTC CACGTCGGCG
AAATATGTAT CGCCTGACGT GATTGGTTCG AGCGATCTCG ACGACTACGG CACCTGGAAG
AAGGACGACC AGTACGGAAA CGTCTGGATC CCGAACGACC AGAATGACAA CTGGCAACCC
TATAGCGACG GCAATTGGGC CTATCAGCAG CCATACGGCT GGACGTGGAT CGGCGCCGAG
CCTTGGGGCT TTGCTCCGTA TCACTATGGC CGTTGGGTGC AGGGCGGCTG GGGCTGGGGC
TGGACGCCCG GACCGTACGC TTACTGGGGC GCGCCGTATT ACGCTCCCGC ACTCGTCGGC
TGGTATGGCG GTGGTTTCGG TATCGGCATA GGCTTCGGCG GCGGCTGGGG TTGGTGCCCG
CTGGGGTGGG GTGAGGCCTA TCATCCTTGG TACCACCACG GACATTCGTA CTTCAATCAC
GTGAATGTGA CGAATACGCA CATCACCAAC ATCAACAACA TTCACAACAA CTACGGCAAC
CATGGCCAGC CGGCGAATTA TCGCCACGGC TTGGTGGTCG CGAATGGTAA GGCTGTCACG
AGCGGCATGA ACATTCGCAA TAACCGGATG AATGTGACCG CGCAGCAGCG GACCGCAATG
TTGCAGCACC CGGTCAACAA TCGCAGCCTC GGAAACGAGC TGCGTCCGAC GGCACAGAGC
CGCATCGGCG GCCAGACCCG CGCGTCCGTT GCACCTCCGG CGCGTACTGC GAACCGACCG
ACGTACTCGC ACCTTGCGCC GCCGGCACGT GGGCAGAATG GCAATGCGGT GAATGTACGC
GCCAACGGCG GCATTAACAC GACACGCCCG AGCGCCGGCT TGAACAACGG CCGCAACGGT
GTAGTCGCGA ACAACGGCCA TGCACCTGCG CCGATGACGC GCAACGTACC GCATCCGCCG
AGCGCTACTC CAGGCTCGTC CAACAACAGC CACTACGTTC CGCGTCCGCC GGCAAGTTCT
GGCCGGCAGA TGCAGAACGC ACCGGCAGCG ACCACGAATT CGCGTCCGGG AGGCACCTAC
GCAAGGCCGG GACAGAGCTA CTCTGCTCCG CCGAGCCAGT CACACTCGAG CCAGTCACAC
AATGTGCCGC GGATGAACGG CCCGGCGCAG CAGTCGTCGC GCAGCTACGC GGCGCCTCCG
AGCCGCAGCT ATAACAGCCC GAACTACGGG CGCAGCTACA GCCCAAGTCC CAGCTACGGG
CGTTCGTACA GCCCGAGCCA GGGCTACGGC CGTGCGCCGA GCTACAGCGC ACCGCACAAC
GGTGCACCCA GTGCCCAGCA GCATTACAGT GCACCGCATT ACAGTGCGCC AAGCGCGCCC
CACTACAGCT CACCGAGCTA CGGTGGTGGC CACGCTTCGG CACCGTCGTA CCACGGTGGC
GGCAGCTACG GGGGTGGTGG TGGTCACGCC AGCAGCGGTG GCGGCGGTCA CAGCAGCGGT
GGTGGTGGCG GCTCGCACGG TGGACATCAC TAA
 
Protein sequence
MRFMRGAFLN AVLATAVTAG FAFAQDAPPP PDSQYPATSN SAQNDSGQYA TPDSNTIPQY 
SAPSPKYSSP SADQPEEGGY PQSAQSQNAP PPPADAQQAN ANGEDNDDSQ DPSRRVARMQ
FMDGQVSIQP GGVNDWVAGT LNRPMTTGDN VWTDQNSKAE LNVGTGTFRM GAETSVTLAN
VADKTTQLQV HQGTLNLRVR HLYDGETYEI DTPNMAFTVQ KPGDYRFDVD PNGDTSFVTV
WKGEGNATGD GPSVAVRQGE KAKFSNGTSM AYTVDRAPGQ DEFDEWAVAR DRHDENSTSA
KYVSPDVIGS SDLDDYGTWK KDDQYGNVWI PNDQNDNWQP YSDGNWAYQQ PYGWTWIGAE
PWGFAPYHYG RWVQGGWGWG WTPGPYAYWG APYYAPALVG WYGGGFGIGI GFGGGWGWCP
LGWGEAYHPW YHHGHSYFNH VNVTNTHITN INNIHNNYGN HGQPANYRHG LVVANGKAVT
SGMNIRNNRM NVTAQQRTAM LQHPVNNRSL GNELRPTAQS RIGGQTRASV APPARTANRP
TYSHLAPPAR GQNGNAVNVR ANGGINTTRP SAGLNNGRNG VVANNGHAPA PMTRNVPHPP
SATPGSSNNS HYVPRPPASS GRQMQNAPAA TTNSRPGGTY ARPGQSYSAP PSQSHSSQSH
NVPRMNGPAQ QSSRSYAAPP SRSYNSPNYG RSYSPSPSYG RSYSPSQGYG RAPSYSAPHN
GAPSAQQHYS APHYSAPSAP HYSSPSYGGG HASAPSYHGG GSYGGGGGHA SSGGGGHSSG
GGGGSHGGHH