Gene Acid345_2261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2261 
Symbol 
ID4073255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2681596 
End bp2683011 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content62% 
IMG OID637984277 
Producthypothetical protein 
Protein accessionYP_591336 
Protein GI94969288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000157771 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAAT TTTTCCCCTC GCGTAGAATG AGTCGTCACG AAAGTGATAT CGCGCCAAGC 
GCGCCGGGGC GAAATCCCGT ACTGAGTACT GAGAGTCTGA AGGAGCTATT CGAAATGAAG
AAACCGCTAG TGACGATGCT GCTAATGATG GCGACCGTCG CCATCCAGCC GCTTGCACTA
CCCAGCCTTG CCGCCGGCCA AGCTGCCGCG CCCCAACAGA AGAAAGAAAT CAAAGACCCC
GCGGAATACA ACGCATATGT AAACGCGGTG CAGCAAGCGG ACCCGAAGGC GAAGGCAACC
GCGCTGCAGT CGTTCCTTCA GACTTATCCC AATAGCGTGA TGAAGACTGA CGCGATGGAA
CTGTTGATGG CCGCTTACCA GCAGGCCGGC GACCAGCAGA ACATGCTGCA GACCGCCCAG
CAGATCATCC AGGTCGAGCC CAACAACGTT CGCGCCCTCG CGCTGCTCGC ATACACCTAC
CGCATGATGG CGCTGCAGAC GGGAAACAAG GACAACGCGG CCCAGGCTGC CCAGTATGGT
CAGAAGGGCC TCACGGCGCT ACAAGTTATC CAGAAGCCCG CTGAAGTCAG CGATGCTGAC
TTCGAGAAGC TGAAGAAGGA AACGCAGATC ATCTTCGACG GCGCTGCCGG CTTTGGCGCG
TTGAACACGA AGGACTACGC AACCGCGCAG AAGGACTTTG AAGACGGCGT GAACCTCGCC
GGCGCCAACG CCTCCTTCCT CGATGTCTAC CAGCTCGCGC TCGCCGATCT CGAGGCGAAC
CCGGTGAACC CGAAGGGTCT CTGGTACATT GCGCACGCTG CCGCCACCGC CCCGAACGAC
CAGGCGAAGA AACAGCTCGG CGACTACGGC CGCAAAAAGT ACAACAAGTT TCACGGTTCT
GAGCAGGGCT GGCCGGAGTT GTTGACCGCC GCCGCTGCCT CGCCGACGCC GCCGCAAGGC
TTCACGGTTG CCCCCGCGCC GCCGCCGCCG AGCCCCGCCG AGCAGGCTGC CGACCTCGTG
AAGAGCAAGG AAGTGAAGGA CATGAGCTTC GCCGAGTGGC AGCTCGTCCT TTCGTCCGGC
AACCAGGATG CGGCTGACAA GGTGTGGAAC ACGATCCACG ACAAGCAGAT CCAGCTCGTC
GCGTTCGTAA TCAGCGCCTC GCGCACCAAG CTCGAACTCG CCGGCAGCAC CGACGACAAC
GACGCGCACA AGGCTGACAT CACCGTCACC ATGGCAACCC CGATTCCGGC AGCGAAGGTG
CCCAAGGAAG GCGCGACCGT GCAGTTCCAG GCCGCTCCCG ACACCTACAC GCCGAATCCG
TTCATGATGA ACATGAAGGA CGGCGAACTG CCCGGCGTAG CCGCTGCACC GGCCCACAAG
CCCGCAGGCG CCCACAAGAA GCCCGCCGCG CAGTAA
 
Protein sequence
MSQFFPSRRM SRHESDIAPS APGRNPVLST ESLKELFEMK KPLVTMLLMM ATVAIQPLAL 
PSLAAGQAAA PQQKKEIKDP AEYNAYVNAV QQADPKAKAT ALQSFLQTYP NSVMKTDAME
LLMAAYQQAG DQQNMLQTAQ QIIQVEPNNV RALALLAYTY RMMALQTGNK DNAAQAAQYG
QKGLTALQVI QKPAEVSDAD FEKLKKETQI IFDGAAGFGA LNTKDYATAQ KDFEDGVNLA
GANASFLDVY QLALADLEAN PVNPKGLWYI AHAAATAPND QAKKQLGDYG RKKYNKFHGS
EQGWPELLTA AAASPTPPQG FTVAPAPPPP SPAEQAADLV KSKEVKDMSF AEWQLVLSSG
NQDAADKVWN TIHDKQIQLV AFVISASRTK LELAGSTDDN DAHKADITVT MATPIPAAKV
PKEGATVQFQ AAPDTYTPNP FMMNMKDGEL PGVAAAPAHK PAGAHKKPAA Q