Gene Acid345_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1106 
Symbol 
ID4069566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1380961 
End bp1382238 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content58% 
IMG OID637983115 
Producthypothetical protein 
Protein accessionYP_590183 
Protein GI94968135 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000160937 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATCGAT TTGTACCTGT TTTCGGGAGC TCTGCGTATC GGGTAAACTG CCCTAGAACC 
CCAGCGATGA CACCTTACCG CTACGCCCGT CCGCTTATTA CCTTGGGCAT CTTATTGATC
TCGCTGTGGG TTAGCTCTCC CGGCCAAATT GTGCCAACCC GCATCAATAC TGACCCGGCT
AGCGCCAAAG GGGCACAGTT CTTCTACAAC CTCGAATATG ACAAATCCAT CCGGGAATTC
GAGGCCGTAT TAAAGGCACA TCCCGACGAT CCTTTCGCGG TCAACCACCT CTTGACGACG
GTGATGTTCA AAGAACTGTA TCGAATCGGC GCCTTAGATA CGGAGCTCTA TTCCGGCGAT
TCGTTCCTGA CCAAGAAGCA GTTTGCGCCG ACCGATCCGC AGGTGGTAGC ACGCGTGAAA
GAGCTGATGG ATCGTGCGTT CTTTCTCGAA GAAGAGCACC TCAAGACGAA TGCCAATGAC
GTCGACGCGC TTTACGCGCG CGGGGTGACC CGGGGCCTAC GCTCCACGTG GACCGCGTTG
GCTGAGAAAG CGTGGTTTGC GGCACTGCGG AGTGCCGTCG GCGCACGGCA CGATCATGAG
CGCGTGCTCG AACTCGATCC CAAGTATGTA GACGCAAAGA CGATCGTCGG CGTGCACATG
TACGTAACCG GCAGTCTGCC ATGGGCGGTG AAAGCGGCGG CTTCCGTCGC GGGATTGTCC
GGCAACAAGC AAAAGGGGCT TGAATACCTT CGCGCCGCCG CCGCGCATGC GCCTGAGAGC
GGCATGGATG CTCGCATCAC GCTAGCGCTT TTTCTCCGCC GCGAGCAAAA GTACGATGAG
TGCCTTCAGG TCGTAAAAGG CATGCACGAT GAGTATCCCC ACAATTTCCT GATCTCTGCC
GAATACGCGC ACCTGTTGAA TGCAGCGGGA CACGGGCCGG AGGCGGTTGC GGAATATCAG
TTGGTCCTCG ACCGCTATCG CAAGGGATGG TTCCCGGTGA GTCGTCCGGA ACAAGCGGCT
TTTGGGCTGG GCGAAGCAGC GCGCGGCCAG AAGCAGTACG AGCTTGCGCT GAACGGGTAC
AACATGGTCA GCACGTTTAA GAACGTGGAT CAGGAACTCA TTCAGCGTAC GAATCTCGGC
GCCGGCGAGG TGCTGGACCT AATGAACCGG CGGGATGATG CCGTAAAGCG CTACCAACAG
GTGCTCGCGT TCAATCAATC GAATGCCGTA GCCGAGCGCG CCAAGCACAA TCTCAAGACC
CCGTATCGCG GGATGTAA
 
Protein sequence
MNRFVPVFGS SAYRVNCPRT PAMTPYRYAR PLITLGILLI SLWVSSPGQI VPTRINTDPA 
SAKGAQFFYN LEYDKSIREF EAVLKAHPDD PFAVNHLLTT VMFKELYRIG ALDTELYSGD
SFLTKKQFAP TDPQVVARVK ELMDRAFFLE EEHLKTNAND VDALYARGVT RGLRSTWTAL
AEKAWFAALR SAVGARHDHE RVLELDPKYV DAKTIVGVHM YVTGSLPWAV KAAASVAGLS
GNKQKGLEYL RAAAAHAPES GMDARITLAL FLRREQKYDE CLQVVKGMHD EYPHNFLISA
EYAHLLNAAG HGPEAVAEYQ LVLDRYRKGW FPVSRPEQAA FGLGEAARGQ KQYELALNGY
NMVSTFKNVD QELIQRTNLG AGEVLDLMNR RDDAVKRYQQ VLAFNQSNAV AERAKHNLKT
PYRGM