Gene Acid345_4433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4433 
Symbol 
ID4070915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5264948 
End bp5266114 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content58% 
IMG OID637986471 
Producthypothetical protein 
Protein accessionYP_593507 
Protein GI94971459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0480068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGGA GTGTCACCGT GAAATTCGCA TGGGCCGCAA TTCTTCTTCT GATGTTCTCG 
ATCGTGGCTG CCGCACAGGA AACTTCGGAA GTCGCGACTC CTCCCAATGG CGACAATCAG
CACGCATCCG TCTCGCAATG GATTGGCCCA GTGAAAATCT CCATCGACTA CCACAGTCCC
AGGGTCCACA ACCCCGCGGA GAACGATCGC ACCGGCCACA TCTGGGGAGA ACTGGTGCAC
TACGGCTTCG TCGACGAGGG CTTTGGCCCG ACGCAGGCCG CGCCCTGGCG TGCCGGTGCG
AACGAGAGCA CCGCCATCAC CTTCTCTCAC GACGTGAAAG TCGAAGGCAA AGACCTGAAG
GCCGGGACCT ACGCGCTCTT TCTCGATGTG GAGAAAACTA GCCCCTGGCA GTGGATCTTC
TCTAACCACC AGGGCTGGGG AAGTTTTCAA TACGATCGCA AGGATGACGT TCTGCGCGTC
CCCGTCGCTG CGCAGGACGC ACCGTTCACC GAATTCCTCA CGTACGGCTT CGACGATCGC
CGGCCGGATT CCGCGGTGGC TTACCTGCAA TGGGAAAAGA AGCGGGTCCC CTTCAAAGTT
GAGGTTCCCA ATGTGAAGGC GCTCTATGTC GCGAAGATGC GTCAGGACCT GCAATCGTGG
GCGGGATTCA ACTACCAGGA CTGGCAGACC GCCGCGCAAT TCTGTGCAGA TAACAAGATC
AATCTCGAGG AAGCGCTGAC CTGGGCGGAC AAGGCGATCA ACGGCCCATT CCGTGGCGCG
ACCATTGGAC ACGAGGAGTT TGCCACTCTC TCCACCAAAG CAGCCGTGCT GAGCGCCATG
GGCCGTGAAG CGGATGCCGA CAGCGTGATG GACAAAGCCC TGCATCTGCG CGCAACCGAC
GCGTACTCCG TTTATGCCTA TGGCATGGGA CTACTGCGCA ACGACAAAAA AGACAAAGCG
ATGAAGGCAT TCACGTTCAA TCAGCAGCAA CATCCGGAAG ACAAGTTCTG GACCGCGCTG
GGACTCGCTC GCGGCTACTC CGCTAACGGC GACAAGAAGA ATGCAATCGC GAATTGGGAA
ATCGTGGTGA AGAACGTGCC CGCCAACCTG AGCAACCGAA CCGCCGGATA CGAGGCAGCG
CTGAAGAAAT TGAAAGAGGC GATCTGA
 
Protein sequence
MDRSVTVKFA WAAILLLMFS IVAAAQETSE VATPPNGDNQ HASVSQWIGP VKISIDYHSP 
RVHNPAENDR TGHIWGELVH YGFVDEGFGP TQAAPWRAGA NESTAITFSH DVKVEGKDLK
AGTYALFLDV EKTSPWQWIF SNHQGWGSFQ YDRKDDVLRV PVAAQDAPFT EFLTYGFDDR
RPDSAVAYLQ WEKKRVPFKV EVPNVKALYV AKMRQDLQSW AGFNYQDWQT AAQFCADNKI
NLEEALTWAD KAINGPFRGA TIGHEEFATL STKAAVLSAM GREADADSVM DKALHLRATD
AYSVYAYGMG LLRNDKKDKA MKAFTFNQQQ HPEDKFWTAL GLARGYSANG DKKNAIANWE
IVVKNVPANL SNRTAGYEAA LKKLKEAI