Gene Acid345_1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1624 
Symbol 
ID4072550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1968707 
End bp1971871 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content58% 
IMG OID637983633 
Producthypothetical protein 
Protein accessionYP_590700 
Protein GI94968652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.6016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTCGTT TGCGCAAGCA CTTTTCGGTC CTGTTGTTCT CACTCCTCTC TTCACTCGTC 
ATCGCGCAAC AACTCCCGGA TATGTCTCAA GCATGGCACT GGCGTCCCAT TGGCCCACTC
CGTGGCGGAC GCACGCGTTC TGTCGCCGGT ATCCCGAACC AACCGAATGT CTTTTACATC
GGGGTGGTGA ATGGAGGGGT ATGGCGCACG AATGATTATG GCCGCACCTG GGACTCAATC
TTCGATAGCC AGCCAACGCA ATCGATCGGG GCAATCGCGG TGGCGCCGAG CGACCCGAAG
GTGATTTACG TTGCCAGCGG GGAAGGCCTT CATCGCCCCG ATTTATCGGT AGGCGATGGA
ATTTACAAAT CAACCGATGC TGGCAAAACT TGGACGCACC TCGGCCTGCG CGACGGTCAG
CAGATTCCCG CACTTGCCGT CGATCCTCGC GATCCGAACA AACTGCTCGT GGCCGTCGCC
GGACATCCCT ACGGCCCGAA CCCCGAGCGT GGCGTCTTCC GCTCCACCGA TGGCGGCCAG
ACTTTTCAGA AAGTCCTCTA CAAAGATGAA AATGTCGGCG CATCTGACGT GCAGATCGAT
CCGAGTAATC CCGACATCGT TTACGCAGGT CTTTGGGAGT CTCGCGAAGG GCCGTGGGAA
AACGGCCAAT GGAACGGCAC CGGCGGCGGC ATCTACAAAT CAACTGACGG CGGCCAGACT
TGGAACCAAC TTGCAGGTGG CCTGCCGGAT GGGATCATCC AAGTCTACGT TGCGATCTCG
CCAAGCTCTC CGAACCGCCT CTATGCTTCG GTCGCTACAA AAGCCGGAGT TCACATATAT
GGATCGAAAG ACGCAGGCAC AACATGGACA ACCGTGACCG ACGACGCGCG TCCCGAACAA
CGAATCGGCG GTGGTGATCT TCCGGTGCCA AAGGTTGATC CACGCGATCC TGAAACGCTT
TACATGACAA GCACCGTTAC ATGGAAATCT ACCGATAGTG GCAAGACTTG GATCGGTTTC
CGCGGCGCGC CCGGCGGCGA TGACTACCAG AATATCTGGA TCAATCCCAA CGACCCTAAG
ATCATCTGCA TCGTCAGCGA TCAGGGCGCA ATCGTCACGG TGAATGGCGG CGAGTCGTGG
AGCTCCTGGT ACAACCAGCC CACTGCGCAG ATGTACCACG TGAATACCGA CAACGCTTTC
CCGTACCGCG TTTGCAGCGG TCAGCAGGAG AGTGGCTCGG CGTGCGTCTC CAGTCGCGGC
GACGACGGCC AGATCACCTT TCGCGAGTGG CACCCCGTCG CAGCCGAAGA GTACGGCTAC
GCTGTCCCCG ATCCACTTGA TCCTGACATC GTAATTGGCG GCAAACTCAC TCGCTACGAC
CGGCGCACCG GTCAGGCACA GAACATCTCA CCACGACCTT TACGCGGTCC CGACTTCCGC
GTCGTCCGCA CGGAGCCGAT CGTCTTCGAT CCGAAGGACC CGCACATTCT CTACTTCGCT
GCAAATGCAC TTTGGAAGAC GACGGACTAC GGCAAGCACT GGACTCAAGC CAGCCCCGAC
CTCACGCGCA AAAATTTCTC GCTTCCGGCG AACATCGGAA AATTCAGCGA TCAGCCAACG
GCGAAGGCAA AGCAACGCGG CGTGATTTAC GCCGTAGCCA TCTCACCTCT CGACACCAAG
CGCATCTGGG CGGGCACCGA TGACGGCCTC CTGCACATCA CCGCCGATGG GGGCGCCCAC
TGGACCGACG TTACCGGCAA CACACTCACT CCGTTCGAGA AAGTTTCCGT GCTCGAAGCC
AGCCATTTCG ACGCACAAAC CGCTTACGCA GCCATCAACA CGCTGCGCCT CGATATTCTC
AAACCGAAGA TCCTGCGCAC GCACGATGGC GGAAAAACTT GGGCGAATGT TCGAGAAGGT
ATTCCCGACG GCGAAACTGT CAATGCTGTC CGCGAAGATC CCAAGCGCAA AGGTCTCCTC
TTCGCCGCTA CCGAAAAGTC TGTTTACGCT TCGTTTGATG ACGGCGACCA CTGGCAATCT
CTGCGTCTGA ATTTACCAGC CAGTTCCGTA CGTGATATTC AAATCCATGG CGACGATCTC
GTCGCTGGAA CTCACGGACG CGGCTTCTGG ATTCTCGACA ACATTTCCGC GCTGCGCGAG
ATTAAGCCGC AAGCATTCGA CCATCCAGTA CTCTTCCAGC CGCAGACGGC CATCCGCGTC
CGCTGGAATA TGAACACCGA CACTCCGCTG CCGCCCGATG AGCCACGCTT GCCCAATCCT
CCCGATGGCG CAATCATCGA CTACTTCTTG CCCGCAGGAT TCCATGGCGA AGTGAAGCTT
GAAATCCACG ACGCCGCCGG AAAGGTTCTT CGCACATTCT CCAGCAACGA TCCCGTGCCG
GCGGATGATC CGAAGCTGGC GATTCCCCGC TACTGGCCGC GCCCGCCACA ACCCCTTGAA
GGGACACCGG GCATGCATCG CTTCTTGTGG GACATGCACC TTGCTGCCAT CCCCGGGATT
CATGCAGAGT ACCCAATCGC TGCCGTGCCG CACGACACCG CCCCGGCGCC AACTTCGCCG
TGGGTGCAGC CTGGCCGCTA CATGGTGGTA CTGCTCGCAA ATGGATCGCG AGCCGACATG
CCGCTCACGA TCGAAATGGA CCCTCGGGTC AAAACAGCCA CTGCGGATCT CGCGCAGCAA
TTCAACGCAT CGCATCAGCT CTATGAAGAC GCGAAGTTGA TCAGCGACGC CGCGGCCCAC
GCGCAAGCGA TTCGCGAGCA ACTCGATCAA CTCCATTCCA AAGGCGGTGC AGCCGCCACT
GACATCGAAG CCTTCAACAA GAAGATCGCC GAGATCGCCG GCGAAGAAGA AGATTTCGGC
CCGCGTCGTC CTGGCACCGC TGAGACTCTA AGCAGCGTTC GGACTGGCGC TCTCTTCCTG
ATGACGATGA TGCAAGACGC TGACGCCGCA CCGACTGAGG CAATGCTTAC CAAAGCCACT
GAAATCCACA CGGCGACGCC AAAAGTCATC GAACGATGGA AGCAGTTCGT GCAGCAGGAG
CTACCGAAGT TCAATGATCG TCTGAAGCAA GACAACCTCA GCCCGTTGAA TCCGCAGGCC
AAGGTCCGCG ACGCAGAAGT CGAACTCCGT GGCAACGAAG AATAG
 
Protein sequence
MIRLRKHFSV LLFSLLSSLV IAQQLPDMSQ AWHWRPIGPL RGGRTRSVAG IPNQPNVFYI 
GVVNGGVWRT NDYGRTWDSI FDSQPTQSIG AIAVAPSDPK VIYVASGEGL HRPDLSVGDG
IYKSTDAGKT WTHLGLRDGQ QIPALAVDPR DPNKLLVAVA GHPYGPNPER GVFRSTDGGQ
TFQKVLYKDE NVGASDVQID PSNPDIVYAG LWESREGPWE NGQWNGTGGG IYKSTDGGQT
WNQLAGGLPD GIIQVYVAIS PSSPNRLYAS VATKAGVHIY GSKDAGTTWT TVTDDARPEQ
RIGGGDLPVP KVDPRDPETL YMTSTVTWKS TDSGKTWIGF RGAPGGDDYQ NIWINPNDPK
IICIVSDQGA IVTVNGGESW SSWYNQPTAQ MYHVNTDNAF PYRVCSGQQE SGSACVSSRG
DDGQITFREW HPVAAEEYGY AVPDPLDPDI VIGGKLTRYD RRTGQAQNIS PRPLRGPDFR
VVRTEPIVFD PKDPHILYFA ANALWKTTDY GKHWTQASPD LTRKNFSLPA NIGKFSDQPT
AKAKQRGVIY AVAISPLDTK RIWAGTDDGL LHITADGGAH WTDVTGNTLT PFEKVSVLEA
SHFDAQTAYA AINTLRLDIL KPKILRTHDG GKTWANVREG IPDGETVNAV REDPKRKGLL
FAATEKSVYA SFDDGDHWQS LRLNLPASSV RDIQIHGDDL VAGTHGRGFW ILDNISALRE
IKPQAFDHPV LFQPQTAIRV RWNMNTDTPL PPDEPRLPNP PDGAIIDYFL PAGFHGEVKL
EIHDAAGKVL RTFSSNDPVP ADDPKLAIPR YWPRPPQPLE GTPGMHRFLW DMHLAAIPGI
HAEYPIAAVP HDTAPAPTSP WVQPGRYMVV LLANGSRADM PLTIEMDPRV KTATADLAQQ
FNASHQLYED AKLISDAAAH AQAIREQLDQ LHSKGGAAAT DIEAFNKKIA EIAGEEEDFG
PRRPGTAETL SSVRTGALFL MTMMQDADAA PTEAMLTKAT EIHTATPKVI ERWKQFVQQE
LPKFNDRLKQ DNLSPLNPQA KVRDAEVELR GNEE