Gene Acid345_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0655 
Symbol 
ID4069747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp807027 
End bp809276 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content57% 
IMG OID637982661 
Producthypothetical protein 
Protein accessionYP_589734 
Protein GI94967686 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.792634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCTT CTGATCATGA CGTCAGCCGT CGCACATTTC TAAAGTTCGG GGCCGCTGGA 
ACGACCTTGG CCGCTCTTAC TCCCACAGCG GTCGCGAATC CCGGGTTCTT CGGCACCGAT
CCAATCGTCA AAAAAAGGCT GACGTGCCCG CCATCGCTTG CCGACATGGC CTCCGATCCG
CAGCGCTATC AATTCCGAGA TCTTTTCAAT TCACCGGCGG CGATGAACGA GTTTGGGTAC
GCGCAGGTTG GCAAGTCGGT CTCGGCAATC ACCGCGATCT CATTTCCTCC GTACGCATGC
TGCGCTCCGC CGTCGATGCC CTGGAGCCCC GGCTATCTTC TCACTTGCGA ACTGTTCCTC
GACGGCCGGT TTGCCGCGAT TGCTCCGGAG CCGGAAGGCG TCGTGGAGTA CCAATGGTTC
CCGCATTGCG TGTTTCGCAA CCAGACCATG GGGGGACTCC AAATCTCAAC CCGTATGTTC
CTGCCCAGCA AGCAACGTGC GGTGATGCAA ACGATCACGG TAAAAAACGC CAGCAGTGGC
CGCAGAACGT TCACGCTTGG TTTCGACATG CGTGGTGCGG TCGCAAAGCA GACGACGCCA
TGGTTCGTGA ATTCTCCCGG CGAAGCCGAC AACAAGATCA CATATGACGC GCGACGCGGC
TGCCTCATCT TCGAAGCGCA GCACTCACAG ATCGTCGGTG TGCAGGGTTT TCATACCTCT
CCCGACCGGG TCGAGCAGAA GCGAATGCTG CTTTTTGACC TCGAGCTTGG TTCCAGGGAA
TCAAAATCTC TGAATTTTAC GGTCGCGCTT GCGGGCGACG CTTCGAGCGC GGTGGAACTC
TATGACAAGT TGCAGGCTAA CTTCGCAGGG ATCGAGAAGG AGAGTGAAGC GACCTTCGAC
CATCTGGTGG GTTCGGCGTT CACACCCGGC AATTCCGAAT TCAGCGGAAA CCTGCCGCGC
CTTGTCACCG ACAACGAAGC TCTCTGGAAG CTGTACCATA ACGGTTTCGC CAACCTACTG
TTTGCAAGGC GCGTATCGCC CGATTCCGTG TACGGTCCAA CCTATCTCAC GCTCAGTGGA
CACGTGCTGC CGACCCTGAG TTTTCCGTGG GACACTTCGC TCACCTCTCT TGCACTAGCA
TTGCTGGACC CGACGCCTCT GCGCACGCTT GTCGAGGCAT GGCTGAAACT CGGTTTGCAC
GACCATCACT CCAGCGACTA CATCAGCGGG CAAGGTGCAG GGCCGTGGTA TGCGGTGAAC
GATACCGCCA TCGTTCGCTG CGCTTGGCGA TACATCTGCG TTACCGGCGA TTTCGCGTGG
CTCGACAAGA AAATTGGCGC TCACTCTGTA CTCGAAGGCC TCGAAGAGCA CGCACTCTAC
TGGAAGAAGC TCGACCCTGC TGGTCACGGA CTCGGAGATT ACGGCACGAT TGAGAATCTT
CTGGAAGTTG TGAGCACGTA TCTCCACGAA GTTGCCGGTA TGAACGCGAA CAATGTACAC
AGCATGCGGG TTGTCGCAGC AATGCACGAG CATCGTGGAA ATCAGGTGCG CGCACAACAA
CTACGGGCGG AAGCGAAGTC GCTGGCCGAG CGCATCATCC ATGACCTTTA CGTCGCCGGA
AAAGGGTATT GGCGCTGTCG CCAGCCAGAT GGCTCGTTCA ACGAAGTCCG TCACTGCTAC
GACTTCCTTG CTGTCCTCGA CAATATGGCC GAGGACCTAT CGCCCGCTCA GAAGCAAGAG
ATGGCCGCAT TCTTCTGGAG GGAGTTGCGC AGCGAAACCT GGATGCGCGC ATTGTCTCAA
AGCGACGCCG ACGCCACTTG GAACATCCGT CCCGACCATA GCTGCCTCGG TGCGTACGGC
GCGTGGCCGG CTATGAGTGC GAAAGGTTTG TACAAGGCCG GCAGCTCACC GAAATTGTCG
GCGTGGCTGA AGCAGGTCGC AAAGGCAGGG AATCAAGGCC CGATCGGGCA AGGCCATTTC
ATCGAAGACG TTTTTCCGCC CGTGAATGGC GGCGCGAGAA AAGCTTCGGA AGATGCGCCA
TACATCGAAG ATTGGTGCTG CATTGCGGCG GGTTCGTTTA CCGAACTTGT AATCGATTCG
ATCTTCGGAG CGGAGCTGAC GATGAAAGAT GGTATCCGAG TGAACTCGCG TCTAGAGGAT
TTCGATCCCA ATGCGCGGCT CGAAGGCCTG CGTTATCAAG GCTCGCTCTA CAGCATCACG
AAGAATGGGG CGCAAAAACA AACCGGATAA
 
Protein sequence
MPSSDHDVSR RTFLKFGAAG TTLAALTPTA VANPGFFGTD PIVKKRLTCP PSLADMASDP 
QRYQFRDLFN SPAAMNEFGY AQVGKSVSAI TAISFPPYAC CAPPSMPWSP GYLLTCELFL
DGRFAAIAPE PEGVVEYQWF PHCVFRNQTM GGLQISTRMF LPSKQRAVMQ TITVKNASSG
RRTFTLGFDM RGAVAKQTTP WFVNSPGEAD NKITYDARRG CLIFEAQHSQ IVGVQGFHTS
PDRVEQKRML LFDLELGSRE SKSLNFTVAL AGDASSAVEL YDKLQANFAG IEKESEATFD
HLVGSAFTPG NSEFSGNLPR LVTDNEALWK LYHNGFANLL FARRVSPDSV YGPTYLTLSG
HVLPTLSFPW DTSLTSLALA LLDPTPLRTL VEAWLKLGLH DHHSSDYISG QGAGPWYAVN
DTAIVRCAWR YICVTGDFAW LDKKIGAHSV LEGLEEHALY WKKLDPAGHG LGDYGTIENL
LEVVSTYLHE VAGMNANNVH SMRVVAAMHE HRGNQVRAQQ LRAEAKSLAE RIIHDLYVAG
KGYWRCRQPD GSFNEVRHCY DFLAVLDNMA EDLSPAQKQE MAAFFWRELR SETWMRALSQ
SDADATWNIR PDHSCLGAYG AWPAMSAKGL YKAGSSPKLS AWLKQVAKAG NQGPIGQGHF
IEDVFPPVNG GARKASEDAP YIEDWCCIAA GSFTELVIDS IFGAELTMKD GIRVNSRLED
FDPNARLEGL RYQGSLYSIT KNGAQKQTG