Gene Acid345_1910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1910 
Symbol 
ID4069388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2295324 
End bp2297267 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content58% 
IMG OID637983921 
Producthypothetical protein 
Protein accessionYP_590985 
Protein GI94968937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.154238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.259383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAACA TGAGGAACAA TACGATTGTC CTCCTCGCGT CATGGGTCGT CTGGTGTGCG 
ATTGCGTTGG TGATGGCGAT GCCAGCAGGC GCGCAGGAAC GAGAGACGGT ACGGGACCGA
GACGACAAGC TGAAAGGGAC GGTATCGGAA TCGCAACTCG ATGCCGCGTT ACAGCGTGGA
TTGCCGGGAA GAGTGATGAC TCCTATGCCG GCGAATGCCA AAGCCACAGT TGAGAAGAGC
GGGGGACTGC GGGGAGCGCC CGCGCGAAGC GTGGAGATGG CGGACATCCC GCAGAAAACG
ACGACGAGTT CGATTCCGGG ACTTGCCGAT GGCGGGCGGC CATTGTCGGG ACTAACGGAA
GATGAGTACC AGGCGTTGAA GCGGCTGGTT GCGATGCGGG CAGCGCGCGG CGAAGTCGGT
ACCAAACCGT CAACCCTGGC GTTGCGGGGC GGCGTGAAGG CAATAGTGCG GCCGGTGCAA
ATGACGCAGA TAACGCCCAA TGCCGCGACC GCTCATGCCA GCTTCTTTTA TGGGCAGAAA
GAAAATGGAC TGGATCCGCC GGACATGGGC CTGGCGACGA GTGAAAACTT CGTCGTACAG
GTGGTGAATG CATCCATTGC CGTGTACGCC AAGAACGGCG TGCTGCAGGC GGGATTCCCA
AAGGATGCGG ACACATTCTT CGGGGTGGCG GCGGGCACGT TCATCTTCGA TCCGCGAGCG
TTCTACGACT GGGCGAACCA TCGGTTCTTC GTGCTGATGG ACACCTTTAC CAGTGGCACG
GCTGGCGGCA GCCTTTACTT TGCGATCAGC AAAACCCAGG ATCCGCGCGG GGGATGGTGG
ATTTACCATT TCACGCCTGC CGGTCTCGGC GCCGGAATTT GCGGTGACTT CCCAACCCTC
GGACATGACA CGACGAATTG GGGAACATAC GCAACCAAGG GCGGAATCTA CATCGGCGTC
AACCTGTGGA CTACGAGCAG CGACTGCGGC GGGGCCAGCT TCTCCAATAA CGCGGTTTAC
CTGTTGCCTA AGGACGCGTT GTACGATGGC GGGGGATACG GCTACTGGTA CTTCACAGGC
TTCAATCTTG GCGGGGTAAA CATGGACACC CTGCAGCCAG TCAACGTGAC GGACCGTGCA
GACAAGCCGA GTACGATCTT CCTGATGAAT ACGCAGAACT ATAACTTCGG CGGTGGACAT
TGCTCGAGCG CCTGCAATGA AATGGTGCTC TGGGGAATAT CGGGACCGAC AGCCGGAACG
GTTCCTGATC CACACAATCC ATTTGCGTTC CTGCAAGGTG GAAATGGACC GATCGTGACG
TCGAAAGTTC TGACCACATT GCAAAATTTC ACTTTCCCAG TGAATGCGGC GGAGCCAGGG
TGCACTACGT CCGGTTCTCC GTGCGTGGAT ACGGACTATG TATTCATTTC CGGAGCGGTG
AAATACCACG CTGGAGAGAT CTTTGGTTCG TTCAACACGG CTGTGGCAAG CAGTCCGGCA
ACCACGGGGC CGATCTGGTT CGACGTGCAT CCATCAACCG ACAACAATGG TCAGGTGACG
AATGCCGTAA TCCGCCAGGA GGATTGTTTC CTCTGCGGCG GCTGGGCCAA TAGCGGCTCC
GCCTACTACG CAAACCTGCA GCCAGACCAG GAAAACAACG TGGTCATGAT CTACAACTAC
TCCGACAATC TGACATATCC CAGGAGCCAG TTGACGTCAC GCCGAGTCAC CTACGGCGAC
TCACTCATGG ATGGTGCAGG CAGCTATTTC AACACCGGGG GCACGGGATT CATGTCGGGC
AGGTGGGGAG ACTACACGGC CACCGCGCCC GATTTCACGA TTGCGAATCG CGGGCTGATT
TGGATGTCTG GCGAATATGC GACGCCGAGC GGTTGGGCGA CCATCATTGG CTGGACACAG
TATCTCAACT CGGATGACCA GTAG
 
Protein sequence
MRNMRNNTIV LLASWVVWCA IALVMAMPAG AQERETVRDR DDKLKGTVSE SQLDAALQRG 
LPGRVMTPMP ANAKATVEKS GGLRGAPARS VEMADIPQKT TTSSIPGLAD GGRPLSGLTE
DEYQALKRLV AMRAARGEVG TKPSTLALRG GVKAIVRPVQ MTQITPNAAT AHASFFYGQK
ENGLDPPDMG LATSENFVVQ VVNASIAVYA KNGVLQAGFP KDADTFFGVA AGTFIFDPRA
FYDWANHRFF VLMDTFTSGT AGGSLYFAIS KTQDPRGGWW IYHFTPAGLG AGICGDFPTL
GHDTTNWGTY ATKGGIYIGV NLWTTSSDCG GASFSNNAVY LLPKDALYDG GGYGYWYFTG
FNLGGVNMDT LQPVNVTDRA DKPSTIFLMN TQNYNFGGGH CSSACNEMVL WGISGPTAGT
VPDPHNPFAF LQGGNGPIVT SKVLTTLQNF TFPVNAAEPG CTTSGSPCVD TDYVFISGAV
KYHAGEIFGS FNTAVASSPA TTGPIWFDVH PSTDNNGQVT NAVIRQEDCF LCGGWANSGS
AYYANLQPDQ ENNVVMIYNY SDNLTYPRSQ LTSRRVTYGD SLMDGAGSYF NTGGTGFMSG
RWGDYTATAP DFTIANRGLI WMSGEYATPS GWATIIGWTQ YLNSDDQ