Gene Acid345_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4388 
Symbol 
ID4073294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5202294 
End bp5203865 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content59% 
IMG OID637986421 
Productypothetical protein 
Protein accessionYP_593462 
Protein GI94971414 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGAC GCAGGTTCTT GCAGTCCACC GCTGCCACGG GATTGACCCT AGGTTTGTTG 
AAGGCCACCA GCCAAGCCTC CGTACCAGAA CACAACTGGG ACAAATACGA TTGGGGAGCG
GGGCCTGCGG TTCCCGACCG GCTTTATCAA GGGCCATTTC CCCAGTACGG TCCATGCGCG
GTAGTTCCCG AGAGCGATGT CTCGATGATC ACTTCGTCGT CGCGCGACAT CGTCAGCAAT
TACGGCATGG GTCTGATGGT TTATGTCTCC GACGACACTG GCCCGATCAA CGTCCCCGGC
CAGCCGCAGG CGCAAGCTCT CGAAGACCTG ATCAAGCTGC CGTTCGCGCA GAAGATCTAC
ATCCGTCCCA ACTGGCGTGA GATCCAGAAA CAACCGGGCC GTCTCGACTT TCCGGAATAC
TGGAAGGAGA CCTTCGACCT CGCCCGCCGT TACGACAAGC GCATCGGCTT CCGCATTCAA
CTGGAAAACC CCGACTCGCC GGAACCGGGC ATGCCCGACT TTCTGATTAA CAAGGTTCCC
TACGTAAAGC TCAAAGGTGA GTGGAAGGGG AGTTCCGGAG AGCAGCGCTA CAAGAAAGAT
AACCGCGTGC CGCGCTACGA CCATCCGGCG TACCAGGCGG CGTTTCGCGA ATTGAACGAA
CTGCTGGCCG CCGAGTACAA CGGTCATCCG CAGGTCGAGT TCATGGACAC CATGATGTAC
GGCTTCTGGG GCGAGGGCCA CACCTGGCCG TACGAGGGCA ATCCTTTCCC GAGTGCGCTG
GTGGCGGAGC AAACGTGGAT GCAGATGCTC GAACTGCAAC TTCAGCTCTG GACAAAAGTT
CCGCTGGCGA CCAACACCCA GCCCGACTTC AGCAATGTGG GCAATGCCGA CATGCTCGAC
CGCACCGTCC GTACCGGCAA CTGGCTGCGC ACCGATACGA TCTTTATTGA GAACACGCAG
ATCGAGGCGT TGAGTTACCG TCCGCCGTGG ACGGCAGCCA TCTGCGAAGT TGGTTTCACC
ACTGGCGATC CCAAAGAACT CCAGATCGAC CAGGACGGAA TCACCTACAA CGAGCAGATC
ATTACCCACG CGGCCGACGT CGGCGTGAAC TACCTGTCGC TCTGGAACTG GCACAAGCTC
TCGGCCCACA ATCTCTTGAG CTACTACGAG AAATATCCGG CGCCGATTGA CGAGATGGCC
CGCAAGATCG GCTACCGCAT TCGACCGTCA TTTATCTGGA CGTTCGTTCG CGACGGAGCG
CATGGGCTGG TGGTGGGCTT GGCGAACGAC GGAATCGCTC CGGTTCCGGG CGTGCTGCGG
CTCACGGTGT TGAGCGAAGA TGGCAAGGTG CATTTCTCCG GCTGCGTCGA CGCCGGCTAT
CCGAAGCCGA TCGGCATTCA TCAGGCCATG CTCCAACTGC CTGCGGGCGT TGACTGGAAA
GGCCTGCGAC TGAAAGCCGA ACTCGAAGTG AAGGGCGTTC GTTATCCGGT CCGATGGGCG
TGCCGGCAGG TCAATCCCGA TGGCTCATTG ACGCTGCGCC ACAACTACAA ACCCGATCAG
CCGTTGGTGT AG
 
Protein sequence
MDRRRFLQST AATGLTLGLL KATSQASVPE HNWDKYDWGA GPAVPDRLYQ GPFPQYGPCA 
VVPESDVSMI TSSSRDIVSN YGMGLMVYVS DDTGPINVPG QPQAQALEDL IKLPFAQKIY
IRPNWREIQK QPGRLDFPEY WKETFDLARR YDKRIGFRIQ LENPDSPEPG MPDFLINKVP
YVKLKGEWKG SSGEQRYKKD NRVPRYDHPA YQAAFRELNE LLAAEYNGHP QVEFMDTMMY
GFWGEGHTWP YEGNPFPSAL VAEQTWMQML ELQLQLWTKV PLATNTQPDF SNVGNADMLD
RTVRTGNWLR TDTIFIENTQ IEALSYRPPW TAAICEVGFT TGDPKELQID QDGITYNEQI
ITHAADVGVN YLSLWNWHKL SAHNLLSYYE KYPAPIDEMA RKIGYRIRPS FIWTFVRDGA
HGLVVGLAND GIAPVPGVLR LTVLSEDGKV HFSGCVDAGY PKPIGIHQAM LQLPAGVDWK
GLRLKAELEV KGVRYPVRWA CRQVNPDGSL TLRHNYKPDQ PLV