Gene Acid345_3599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3599 
Symbol 
ID4072821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4255581 
End bp4257665 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content60% 
IMG OID637985622 
Producthypothetical protein 
Protein accessionYP_592674 
Protein GI94970626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.161985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00880556 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTCGTCC CGACGCGTCA AAACTCCCTG CTGCCGTGGA AGTGCCTGCT ATTTGCCGGT 
TTTGCGGCAT GGCTTGCTTC CATGAGTGCA TTTGCACAAA CCAAACCTTC TGCGGTCACA
ATCACGAAGC CGGCAACGTC CGCGACTATG TTGTCCGAGT TGCCCGCCAG CGAGAAGGCG
ATTCTTGCCC AAGCTCCCGA AAACTTTTAC GAGTTTGGCG CGGCCAAGTT GGGCGAAGTC
ACCGGTCCGC AGCGGTTCAC CGTGCAGTTC GCGCAAACTG CAACCATCAC GAAGATCACT
TCGTCCGCCG ATTTCAAGGT GGAAAGCGAC AGCACCTGCG GGGAAGGACG AACTTATAGC
GCAGGCACCG ATTGCATCCT GATGGTGCGG TTTGTGCCGC AGGGCCCCGG CCACCGAGCA
GGGAAATTGC AGGTGAGCCA TAGTGCAAGC GTGCTGCCGA TGGGCTTTGG AATTGGCGGT
TACGGCTACG CGCCGGTGAT CAGCTTCGTA CCGTCGATCA TCACGACGGT TCCGGCGACA
GTTTCGGCGG GCGTCGGCCT GCTGAACGGC GCCCAGAATC TTGCACTGGC CGGCGATACG
CTGCTGATTG CGGACACTGG CAATGGGAAG GTCCTCTCGA TCGATTCGAC TGGAACGTCG
AAAGCTCTCG CCACCGGTTA TACCGGCGTG TACGGAGTCG CACAGGATAC CTTCGGGCAG
ATTTACTACG ATGTGCCGTC CACCGGCAAG ATGTACGAGA TCTACGACTA CGGCCCGGTC
GTTCAAGTCA GCGGGACGGG CACCATCGCT TGTCCCGCTT CGACTCCATG CACTCTCAGT
TCCGAGGCGC TGGGGACTCC GGGCGCAATG TCCATCGATC CGTACAATCA CCTGTTTTTC
GTTGACAGCC ATCAAGGCGC GGCTTTTTCC ACGGTTCAGC CCATACCCGC GAACCTGATC
TTCCTCTACG ATCCATTTCC GTACCAGCAA AGCCCGTCGG CGACGATGGC CGTGGATTCC
AGCGACAATC TTTACTCGCT CTGGGCAAAC GGTAGCGTCT GCGAAATCGC GCAGCAGTCG
TTGTACAACG CGGAGAACAA TTTCGTATCG TTCATCAAGG TCGCCGGTGG ACACACGTGC
GGCTTCGCAG GTGATAATGG CCAAGCCACC AATGCTGAGA TCGGCAACAA GATCGGGCAG
ATTGCTTTCG ATGCTGCCGG GAACCTGTAC TTCACCGATA AGGCGAACAA TCGCGTGCGC
CGCATTGACT ACACCAGCGG CATCATCCGC ACGATTGCCG GCAATGGCAT TGCGGGGTAT
ATCGGCGATA CCGCCGGAGC GACAACCGCT GAGTTGGCCA ACCCGACTGG CGTAGGCGTT
GATTCGGAGG GGCAGGTGTT CATCATTGCG GGGGACGCGG CCAGCAGCAC AACGCAAGTT
GTGCGTCGAG TGACCACGTT CGGCAAACTG GTGCTGCCGA CGACGAAGAA CGGCGTGAAG
AGCGCTGCTT CGACGGTCCT GGTTTCGAAT ACCGGCAACT CTGCGATGCA ACTCACAAAC
GCCAAGTTCA CGGGGACGAA TCCCAACGAC TTCGCGGTCG ATCCCAACAC CACGACCTGC
TTGCTGACAC CGGGCAGCGT GCTCGCCGCG GGGCAGAGTT GCCAGGTCGG CTTCATCTTC
CAGCCGACCG GAACAGGTTC ACGCTCGGCG TACTTCAACC TGCTCAGCAA CACGTTACTG
GGCATCAACA ACGTGACGCT CAGCGGCGCG GCGACCACGG CCCCGGCGAT CCGGTTCACC
GCGCCGACGA GCGCCAGCTT GCTTCTGGCC AATGCAAAAG TCGGCATGAA AGTCAGTGTG
ACGTCGGATG CTTCCACGGC TCCGACCGGC ACGGTGACTT TCAAGGTTGA CGGTAAGCAG
ATCGGAGGGA CGGTTGCACT GGCCGGCGGA AGCGCTTCTG CGTTGCAGAC CAGTTTGTCT
GCGGGATCGC ACACGCTCTC GGTGTCGTAC AGTGGCGACC GGTACACAGC CTCGGGTACG
GTGAGTGAGA CGGTCACGGT AAAAACCGCG GCGACGTCCA ACTAA
 
Protein sequence
MFVPTRQNSL LPWKCLLFAG FAAWLASMSA FAQTKPSAVT ITKPATSATM LSELPASEKA 
ILAQAPENFY EFGAAKLGEV TGPQRFTVQF AQTATITKIT SSADFKVESD STCGEGRTYS
AGTDCILMVR FVPQGPGHRA GKLQVSHSAS VLPMGFGIGG YGYAPVISFV PSIITTVPAT
VSAGVGLLNG AQNLALAGDT LLIADTGNGK VLSIDSTGTS KALATGYTGV YGVAQDTFGQ
IYYDVPSTGK MYEIYDYGPV VQVSGTGTIA CPASTPCTLS SEALGTPGAM SIDPYNHLFF
VDSHQGAAFS TVQPIPANLI FLYDPFPYQQ SPSATMAVDS SDNLYSLWAN GSVCEIAQQS
LYNAENNFVS FIKVAGGHTC GFAGDNGQAT NAEIGNKIGQ IAFDAAGNLY FTDKANNRVR
RIDYTSGIIR TIAGNGIAGY IGDTAGATTA ELANPTGVGV DSEGQVFIIA GDAASSTTQV
VRRVTTFGKL VLPTTKNGVK SAASTVLVSN TGNSAMQLTN AKFTGTNPND FAVDPNTTTC
LLTPGSVLAA GQSCQVGFIF QPTGTGSRSA YFNLLSNTLL GINNVTLSGA ATTAPAIRFT
APTSASLLLA NAKVGMKVSV TSDASTAPTG TVTFKVDGKQ IGGTVALAGG SASALQTSLS
AGSHTLSVSY SGDRYTASGT VSETVTVKTA ATSN