Gene Acid345_4386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4386 
Symbol 
ID4073292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5199273 
End bp5200757 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content60% 
IMG OID637986419 
Producthypothetical protein 
Protein accessionYP_593460 
Protein GI94971412 
COG category[S] Function unknown 
COG ID[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAGT TCGACGTGAA TCGCCGCGAG TTCGTGAAGC TCGCGGCTGG CACCGGCGCT 
GCCCTCGCTC TGCCAAATTT CGCACTTGGC GATGTCTCTT CCAAAATGAT CGGCATCCAG
GTGGGCGCAG TTTCGTTTGT GGACGAAGGC ACGGAGAAAG TCCTGGATGT ACTCCAGGAG
CGCGCCGGCG TGAATACCCT GTTTCTTGCG GTATTCACCT ACGGCCGCGG CATTGCCGGC
CGGCAGATTC ACGGACAGCC GCTGCCCGAT CACGGCAAGC AGGAGTACGA TCTCAACTTT
CACGGCGGGA ACTTCGCCAC GCCTCATCCT GAGTTCTACA AGAACACGGT GATCAAGGAA
ACGCGCGCGC CCGACCACGG CAACCTCGAC ATCCTGGCGG AAGTTCTCCC CGCCGCGAAG
AAGCGTGGCA TGAAAGTCAT CTGCTGGCTG GAGGACGTGT TTCGCACCGA CCTGCCCAAC
ATCGCGAAGG TCCAGGAACG CGACCTCCAC GGGCGGAACA CCGAGACGCT CTGCGTCAAC
AATCCCGACT ATCGAAATTT CTTCACCGGC CTGGTCGAAG ACTACGCGCG GTCCTACGAC
ATTGACGGAA TCATGTGGGG CTCGGAACGG CAGGGCGCGC TTTCCGACAG TCTGGGCGCC
ACCCACGACA CGCCGCCAAT CGATCCCGGA GAGGTGACCT GCTTCTGCGA GTTCTGTCAG
GCCAAGGCGA AGCAGCGCGG CATCAACCCG GAGCGCGCAC GGCAGGGATT TCTTGAGTTG
GAAAAATTCG TTCGCGCGTC TCGCGGCGGC AAGCGCCCCG TGGACGGATA CTACGTTCAA
TTCTGGCGGA TCCTGTTGCG CTATCCCGAA CTGGCCGCCT GGGAAATGCT CTTCACGGAT
GGTCTTCGCG AAAACTACGC CGCCATCTAC AAGACCGTCA AAGCAGCGAA ACCGGCGGTG
CCGGTGGGTT GGCACATCTG GCACAACAAC TCGTTCAACC CCATCTATCG CGCGGAACAA
GACCTGCAGG AGCTCGCGAA GTATTCCGAT TTCATAAAGG TCGTGATGTA CAACAACTGC
GGCGGCGAGC GAATGGCGCT CTACGCGGAC AATATCGGCT CGACACTCTA TGGCGATCTC
TCGAAACAAG GCCTGCTCGA TATGAACTAT GCGCTGATGG GACTCAAGGA GGGCAGCTAC
GAGCAGATCC CTCGCACCGG ACTTTCCGCC GACTACGTCT TCCGCGAAAC GAAACGCGCG
CTGGAAGGAG TTGCCGGCAC CAACACCCGG ATCTGGCCCG GCATCGATAT CGATATCCCG
ACTGAGCCCG AGAACAGCAA GTGCACTCCG CAAAGCGTAA AGGCCGCAGT TCTCGCGGCA
CTCCGCGCCG GCGCCTCCGG CGTGCTCCTA TCGCGCAAGT ATTCTGAGAT GCGCCTCGCG
AACCTCAGCG GCGCCGGGGA CGCGATTCGC GAGTTCAAAG TGTAG
 
Protein sequence
MLKFDVNRRE FVKLAAGTGA ALALPNFALG DVSSKMIGIQ VGAVSFVDEG TEKVLDVLQE 
RAGVNTLFLA VFTYGRGIAG RQIHGQPLPD HGKQEYDLNF HGGNFATPHP EFYKNTVIKE
TRAPDHGNLD ILAEVLPAAK KRGMKVICWL EDVFRTDLPN IAKVQERDLH GRNTETLCVN
NPDYRNFFTG LVEDYARSYD IDGIMWGSER QGALSDSLGA THDTPPIDPG EVTCFCEFCQ
AKAKQRGINP ERARQGFLEL EKFVRASRGG KRPVDGYYVQ FWRILLRYPE LAAWEMLFTD
GLRENYAAIY KTVKAAKPAV PVGWHIWHNN SFNPIYRAEQ DLQELAKYSD FIKVVMYNNC
GGERMALYAD NIGSTLYGDL SKQGLLDMNY ALMGLKEGSY EQIPRTGLSA DYVFRETKRA
LEGVAGTNTR IWPGIDIDIP TEPENSKCTP QSVKAAVLAA LRAGASGVLL SRKYSEMRLA
NLSGAGDAIR EFKV