Gene Acid345_2589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2589 
Symbol 
ID4070552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3058112 
End bp3059302 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID637984606 
Producthypothetical protein 
Protein accessionYP_591664 
Protein GI94969616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTG CCCGTCTCCC GGCCCCGTCA GTCACGGAAT TCGATTCGCC AATCCTCGCC 
CTTCACCATT GGTTACGTGA GCGAAACTAT GCCGGACACG AACCCTATGA TCTGCTGAAT
TCGCCGCTGC TTCGCAAGTG GGCTGTGCAT CAACCTTTCG CCACTCTCTT CATTCAGGGC
GGCAAACGGA TCGGCGGCGT TCACCTCCGC CAGTGGCTCC ACGTTCCACC CAGTCATAAT
CCCAAAGCTC TCGCACTAGT ATTGAGCGCA TTCTGCGATC TCGCGCGCTC GGGTTGGTTC
TCCCGTCGCC ACGCGGAACA TGTCCGGAAC TTGCTGCTTG AACTCCGCAG TCCGCACGAA
TCCGACTTCT GCTGGGGATA CGACTGGCAT TACGTTTCAT TGCGCGGCGC TCGCATGCCG
GCGTTCTCGC CGAACTCCGT CGTCACCGTC TTCTGCGCCC ACGCTCTCCT CGACTTCGCC
AACATCTACC AGGACGAAGA ATCAAAAGCG ATCGCACATT CCGCGACAAA CTGGCTCGCA
ACCCGATTGA ATCGTTCTAC CGACACCGAT ACTGGCCTCT GCCTCAGCTA CACGCCCAAC
GACCATACCC GGATTTTCAA CAACAGCGCG CTCGCAGGTG CGTTGTTCGC GAGGATCGCG
AGCGACTCAC GACTGCCCCA GTACGGAAGT CTGGCTCGCC GTATCATGGA ATACCTAGGC
AACGGCCAGG CGAAAGACGG ATCCTGGACC TACGGCGTCG CGCGCTCACA ACAGTGGATT
GACACCTTCC ACACCGGATA CAACCTTTGT GCGCTGCTCG AATACCAGCA ACTCACCGGC
GATACCAGCT TTTCGCAAGC CCTCGCCCGC GGTTATGACT TTTATTGTTC CCACTTCTTC
TGTCCGGACG GCGCGCCGCG CTACTTCCAT AACCGCACTT ACCCAATTGA TATCCATTCC
TGCTCGCAGG CGATCCTGAC CCTCTGTGCC TTCGCTGAGC TTGACCCCGA TGCCCTCTCA
CGCGCCGAGC AAATCGCGCG CTGGACCATC CAGCACCTCC GCAACTCCGA CGGCTCTTTC
GGCTACCAGA TTCATCCTCA TCGGGTTGAC CGCACTCCTT ACATCCGCTG GTCGCAAGCC
TGGATGCTTC GCGCGCTCGC CCGCCTGCGC CTGACAATCG GAGGCGAATA A
 
Protein sequence
MNAARLPAPS VTEFDSPILA LHHWLRERNY AGHEPYDLLN SPLLRKWAVH QPFATLFIQG 
GKRIGGVHLR QWLHVPPSHN PKALALVLSA FCDLARSGWF SRRHAEHVRN LLLELRSPHE
SDFCWGYDWH YVSLRGARMP AFSPNSVVTV FCAHALLDFA NIYQDEESKA IAHSATNWLA
TRLNRSTDTD TGLCLSYTPN DHTRIFNNSA LAGALFARIA SDSRLPQYGS LARRIMEYLG
NGQAKDGSWT YGVARSQQWI DTFHTGYNLC ALLEYQQLTG DTSFSQALAR GYDFYCSHFF
CPDGAPRYFH NRTYPIDIHS CSQAILTLCA FAELDPDALS RAEQIARWTI QHLRNSDGSF
GYQIHPHRVD RTPYIRWSQA WMLRALARLR LTIGGE