Gene Acid345_0320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0320 
Symbol 
ID4068597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp346156 
End bp347979 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content57% 
IMG OID637982323 
Producthypothetical protein 
Protein accessionYP_589399 
Protein GI94967351 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.280201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.603461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGATC GTCGGGATTT TCTAAAGACT GCTGGAATTG CGGCAGCCGG CTCCTATGCG 
AGCCGGCTGT CCGCTGCCCA AGCAGGACCA TCGCAAGCGC ATCGGTTCAC GACCTTCAAC
TACGGGGATG TTCAGCTACT CGATGGTCCG CTGAAGAAAC AGTTTGACGA GAACCATGCG
TTCTTTCTCA AGCTCGATGA AGACCGCCTG CTGAAAGTCT TCCGCCAGAA AGCCGGACTG
CCCGCGCCCG GCGAGGACAT GGGTGGCTGG TACGACCTCA CGGGCTTCGA TCTCGCCAAG
GGCGACTTCC ACGGCTTCGT TCCCGGGCAC ACCTTGGGCC AATATGTTTC GGCGTTGGCC
CGATGCTATG CCGCCACAGG ATCGGAAGAG ACCAAGGCGA AGGTTCATCG ACTGGTGAAG
GGTTACGGCG CCACGCTCGA CGACAAAGCT TCATTCTTCG CCGGCTATCG CCTACCGGCC
TACACCTACG ACAAGCTCTC GTGCGGGCTA ATCGACGCGC ACGAGTTCGC ACACGATCCC
GACGCGATGG CGATTCACGA AAAGCTGACG CGCGGCATGT TGCAATATCT TCCTGAAAAA
GCATTGTCGC GAGCGGAGCA GCGGGCACGG CCCCACAAAG ATGAGTCGTT CACGTGGGAC
GAGAGCTACA CGCTGCCGGA GAATCTGTTC CTCGCCTATC GCCGGACGGG CAACAAGTTC
TATCGCGAGC TCGGAACTCG TTTCCTGGAA GACGATACCT ATTTCAATCC GCTCTCGGAG
GGTATCAACG TGCTCGCGGG TGAGCACGCC TATAGCCACA TGAATGCCTT CTGTTCGGCG
ATGCAGGCCT ACCTCACGCT CGACAGCGAA CGGCACCGCA AAGCGGCGCG GAATGGCTTT
CGCATGGTCG CCGAACAAAG CTTTGCCACT GGCGGATGGG GACCGAGCGA GGCATTTGTA
GAGTTCAACA AAGGCCAGCT TGGCGACAGC CTGGAAAAGT CGCACTCGAG CTTCGAGACT
CCTTGCGGCG CGTACGCACA TTTCAAACTG ACACGATATC TTCTCCAAAC CGACGGCGAC
TCCACTTACG GCGACAGCAT GGAGCGCGTG ATGTACAACA CTGTGCTCGG CGCCAAACCG
ATCCAGCCGG ACGGAACTAG CTTCTATTAT TCGGATTACG CTACCGTTGG CAAAAAGGTC
TATCACAACG ACAAGTGGCC GTGCTGCTCA GGCACGCTTC CGCAAGTCGC AGCGGATTAC
CACATCAGCA TCTATCTCAA AGCGACAGAC GGCGTTTGTG TAAATCTATT TGTTCCTTCG
ACGCTCATCT GGAAGGCCAG CGATGGGAGT TGCAAGCTCA CGCAGGAGAC GAAGTACCCG
TTCGAGACTT CCGTCGCAAT GCGATTCGCT ACCACGCAGC CTGTCGAACA AACTCTGTAC
ATCAGGATTC CGGCGTGGGT CACCAGCGAA CCTGCACTTC GCGTGAATGG CCAACGCACA
GACGTTGCGG CGAAACCGGG AGCATTCGCG GCAATCCGGC GTACTTGGAA GGACGGCGAT
CGCATTGATC TCGACCTGCC TATGGGCTTC GAGTTGCAAC CCGTCGATGG GCAACACGAG
AAACTCGTAG CCCTTGTTCA CGGCCCCTTA GTGTTGTTCG CAATCGGCGA TTCGCGGCCG
CGTTTTCATC GCTCCGACCT GCTCGACGCC AAGCCATCCG CCAACAACGA CTGGAGCGTT
CGCGCCGCCG GTGGCAAGCA AGTTGTTTTT AGATCGTTCC TAAAGATTCA GGATGAGAGT
TACAGCACGT ACGTTGAGAT CTAG
 
Protein sequence
MLDRRDFLKT AGIAAAGSYA SRLSAAQAGP SQAHRFTTFN YGDVQLLDGP LKKQFDENHA 
FFLKLDEDRL LKVFRQKAGL PAPGEDMGGW YDLTGFDLAK GDFHGFVPGH TLGQYVSALA
RCYAATGSEE TKAKVHRLVK GYGATLDDKA SFFAGYRLPA YTYDKLSCGL IDAHEFAHDP
DAMAIHEKLT RGMLQYLPEK ALSRAEQRAR PHKDESFTWD ESYTLPENLF LAYRRTGNKF
YRELGTRFLE DDTYFNPLSE GINVLAGEHA YSHMNAFCSA MQAYLTLDSE RHRKAARNGF
RMVAEQSFAT GGWGPSEAFV EFNKGQLGDS LEKSHSSFET PCGAYAHFKL TRYLLQTDGD
STYGDSMERV MYNTVLGAKP IQPDGTSFYY SDYATVGKKV YHNDKWPCCS GTLPQVAADY
HISIYLKATD GVCVNLFVPS TLIWKASDGS CKLTQETKYP FETSVAMRFA TTQPVEQTLY
IRIPAWVTSE PALRVNGQRT DVAAKPGAFA AIRRTWKDGD RIDLDLPMGF ELQPVDGQHE
KLVALVHGPL VLFAIGDSRP RFHRSDLLDA KPSANNDWSV RAAGGKQVVF RSFLKIQDES
YSTYVEI