Gene Acid345_0532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0532 
Symbol 
ID4069952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp653386 
End bp656871 
Gene Length3486 bp 
Protein Length1161 aa 
Translation table11 
GC content57% 
IMG OID637982537 
Producthypothetical protein 
Protein accessionYP_589611 
Protein GI94967563 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.909749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.440813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCC GTTTCACAGA AATCACCCTA TTCCTGTCCG TATTGTGCCT GCTGTTGGGA 
ACAGCGTATG CACAGTACGG CGCGAGCCTG GAAGGGACTG TCACCGACAA GTCGGGTGCA
GTCGTCCCGG GCGCTAATGT CACGATTACT GACCAGGCCA CATCGGTTTC GAGGAACACG
GTCACCTCGG GATCTGGCTT TTATCGTGTG ACCGCAATGC CGCCGGGCAT GTACACGGTG
AGCGTGGAAG CAAGCTCTTT CGGAAAATCT GAAACGAAGA ACGTGGACGT ACAGGCCGAG
AAGGTGCGCG GCCTCAACAT CACGGTGTCT CCCGCGGCCA CGCAGCAATC GGTGGACGTG
AGCACGGAAG CAGCGGGTCT GGAAACAGAA TCGGCGAACG TGGATGGAAC CATTACCACG
AAGCAGGTAG ACCGGCTGCC AACCTATGGA CGCGATCCGT ATTCGCTCCT GCGACTGGCG
CCAGGCGTGT TCGGCGATGC TTCTCTCGCC GGGAATGGCA CTGCGAACTG GTTTCCGAAC
TCACCAGGAC CGGGACAGAA CGACCAGCCT GGAATTTTCC AAAACGAAAA CTTTGTGCAA
GCCACGGCCA ATGGCCAGCG CGCCAGCGGC AACAACTTCA TGATTGACGG TACTAGCGTC
AACAGCCTGA CGTGGGGCGG CGCCGCGATC ATCACGCCGA ACCAGGAATC GATCCAGGAA
ATTACCGTCG TCTCGTCGAC GTATTCGGCG GAAGACGGCC GCAACTCCGG CGCGCAGGTG
AAAGTCGTGT CGAAGAGCGG CACCAACAAC TTCCACGGCA GCGGTTTCTT CAAATACGAC
GAGCCGGGGC TGAATGCGCA GAACCAATGG GGCGGACCGA CGCCGGGAAC TCCAACGGAA
AAAGTTAACG TAAAGGCGCG CGATTTCGGT GGCAGTATCG GTGGCCCGAT TGTGAAGAAC
AAGTTGTTCT TCTTCTTCTC TTATGAAGGC GGCCGCTATT CGAACACCAA CTACTCGACA
CAATGGGTCG AGACCCCTGA ATTCGACCAG TTACTCGCCG CGACGTATCC GAACTCCCTG
ATCGGGCAAG CGGTGACGTT GCCGGGGAAT GCTCCACGCA TCCTGAACGT TATTCCGCAG
ACGGATTGCA GCGCAGTTCT TGGGCAGGGC TACACCGCAC CTGGATGCCA CGTTATCAAC
GGGCGCCTTG ACGTGGGTTC GCCGATCGGC ACCTACGGCG ACTATGCTCC GGTATTCACG
GCCGGCAGCT TGGGAAGTGG GTTCGACGGC GTTCCAGACC TGGAAGAGGC GTTCATTGCG
CTGCCGGGCA CCAGCAAGGG AAACCAATAC AACCTGCGCG TGGATTACAA CCTGGGCAGC
AAGGACCTGC TGGCGTTCAG CGGATACATG GTTCCGCGTA ACGACGTGGT CGCAACAAAC
AGCGGGCGTC CGAATGAAGA CCTGACGTTC GAGCCGCGCA ATAAGTACGG CGCGATTTTG
TGGAACCACA CCTTCTCGGC AACGTTGTTG AACGAATTCC GCGTAAATGC CACGCGGTTC
TTCACCAACC AGTACGACAC CAACAAGAAT GCGTATTGGG GCATTCCTTA CCTGCAGATC
GAACAGATTC CGAACAACCG CATCAATTAC GGCGCGACGC AGGGCACGAA TGCTCCGCTG
ATGGCGGCGC AGAACCAATT TGAATTCCGT GACAATCTGA GCAAGAACAT GGGACGTCAC
GCGTTCAAGA TGGGCGGCTC GTTCGCGATG AACCAGGACA ATAACGACTA CGAATTCGGC
AGCCAGCGTC CGATCTTCGT TTATCACGAG CTCTGGAACT TCTTTAACGG CGCTCCGATT
TACGAAGGCA TAGACGCTGA TCCGCGCAGC GGCCAACCGA CGGATGTACA TAAGTACTTC
CGCCAGAACG ACTGGGCGCT GTACTTCCAG GACGATTGGA AGATCACGCC GAAATTGACG
TTGAACATCG GCATTCGCTA CGAGTACTTC GCACCGCTGA CTGAGAAGTA TGGACGGCTG
AGCAATCTGT ATCTCGGCGC GGCCGGCCCG GACGCGTTGA CGACTGCGAC TGTCAAAACA
ACGGATCAAC TCTACCCACC AGACAGGAAC AATTTTGCGC CTCGTCTCGG CTTTGCGTGG
AGTCCGTTCT CCGACGCGAA AACTGTGATT CGCGGCGGTG TGGGTGTTTC CTATAACCGC
ATTACTGACA CAATGACCGG CATTTCGCGC GTCAATCCGC CGTATCTGTT CCGCTATGGC
ATCTGCTGCG GAACCGAGAC CGGCGCGTTC GGAACGCCTT ACGTCGGCGG ACAGATTGAC
CCTAACGTTG TGGGCACGAA CTATCAATCG ATGTACAACT ACGGCCCGAA TCCCGTGCTA
ACGAACAACT TCGATCCCAC CACCGGATTG CCAATTACTG GATCGGTAGA AGTCTGGGGC
GCGCCGCAGA ACATGGCAAC GCCGTACATC TGGAATTACT CACTGGATGT GCAGCAGGAG
TTGCCTTGGA ATATGGTGCT CGATATTGGC TATGCCGGCA GCGAAACCCG CAAGCTGCTG
CGCATCGTCA ACCTCGATTA CATCTACAGC AACGCCGGCT CGGATACGGC TTCTCATGCG
AACCCGGTGT ACTTCCCATC GACGAGCGCA AACGGCAACT ACAGTGCTTT GTTGGTTAAC
CTGAATCGCC GCTTCTCAAA TGGACTCCAG TTCATCGGGA AGTATCGCTT CAGCAAGAGC
ATGGATACGG TGAGCGGAGA AGGCGCGGGC TTCGAGACCA ACCAGTTCTG GCCTCTGAAC
CAGACCTGGG ACTACGGTCC GTCCGACTTC GACTCGACGC ACAACATCCT GTTCACGGCG
TTGTGGGATC TGCCGATTTA CCGGAACCGT CATGATTGGG TTGGTACCCT TCTCGGCGGC
TGGCACGTCG ATGGAACCTA CCAATACCAC TCCGGCTTCC CGTGGAGCCC GGTGCAGTCG
AATGACTGCC CGACAATTCC ATTGATCGGA TCGGCTCCGT GTCCGGCGCT CGTAACCGCT
CAGTACATGA ACGGCAAATC GGGTTTGGCG AACGATGACT TCCTGCATGG CGGCATCTTC
CCGAACGCGC TCGTAACCGT AACTTGCCCC GGCGGCGCAA CGGCTACGAT CAACCAGTAC
TTCCAGCCGG CGGACTGCTC GCAGCCTCCG GCGGGACCGC CGTTCATTCA CCGCAACTCG
TTCCGTGGAC CGAATTACTC CACGGTTGAC CTTGCTATCG GTAAAACCGC ACGCCTGCCC
TGGTTTGGTG GCGAAAGTTC GCAAATCGAC TTCCGCGCCA ACCTCTACAA CGCCTTCAAC
CGGCTCAACT TCGAACCGTT TGGCTATTCC TCGTCGGCAA CGTCGATCAA TAGCAATACG
TTCGGGCAGC CGCAGGCGGC TTTGGCAGGA CGCGTAATCG ACTTCCAGGT GCGCTTCACC
TTCTAA
 
Protein sequence
MSRRFTEITL FLSVLCLLLG TAYAQYGASL EGTVTDKSGA VVPGANVTIT DQATSVSRNT 
VTSGSGFYRV TAMPPGMYTV SVEASSFGKS ETKNVDVQAE KVRGLNITVS PAATQQSVDV
STEAAGLETE SANVDGTITT KQVDRLPTYG RDPYSLLRLA PGVFGDASLA GNGTANWFPN
SPGPGQNDQP GIFQNENFVQ ATANGQRASG NNFMIDGTSV NSLTWGGAAI ITPNQESIQE
ITVVSSTYSA EDGRNSGAQV KVVSKSGTNN FHGSGFFKYD EPGLNAQNQW GGPTPGTPTE
KVNVKARDFG GSIGGPIVKN KLFFFFSYEG GRYSNTNYST QWVETPEFDQ LLAATYPNSL
IGQAVTLPGN APRILNVIPQ TDCSAVLGQG YTAPGCHVIN GRLDVGSPIG TYGDYAPVFT
AGSLGSGFDG VPDLEEAFIA LPGTSKGNQY NLRVDYNLGS KDLLAFSGYM VPRNDVVATN
SGRPNEDLTF EPRNKYGAIL WNHTFSATLL NEFRVNATRF FTNQYDTNKN AYWGIPYLQI
EQIPNNRINY GATQGTNAPL MAAQNQFEFR DNLSKNMGRH AFKMGGSFAM NQDNNDYEFG
SQRPIFVYHE LWNFFNGAPI YEGIDADPRS GQPTDVHKYF RQNDWALYFQ DDWKITPKLT
LNIGIRYEYF APLTEKYGRL SNLYLGAAGP DALTTATVKT TDQLYPPDRN NFAPRLGFAW
SPFSDAKTVI RGGVGVSYNR ITDTMTGISR VNPPYLFRYG ICCGTETGAF GTPYVGGQID
PNVVGTNYQS MYNYGPNPVL TNNFDPTTGL PITGSVEVWG APQNMATPYI WNYSLDVQQE
LPWNMVLDIG YAGSETRKLL RIVNLDYIYS NAGSDTASHA NPVYFPSTSA NGNYSALLVN
LNRRFSNGLQ FIGKYRFSKS MDTVSGEGAG FETNQFWPLN QTWDYGPSDF DSTHNILFTA
LWDLPIYRNR HDWVGTLLGG WHVDGTYQYH SGFPWSPVQS NDCPTIPLIG SAPCPALVTA
QYMNGKSGLA NDDFLHGGIF PNALVTVTCP GGATATINQY FQPADCSQPP AGPPFIHRNS
FRGPNYSTVD LAIGKTARLP WFGGESSQID FRANLYNAFN RLNFEPFGYS SSATSINSNT
FGQPQAALAG RVIDFQVRFT F