Gene Acid345_3330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3330 
Symbol 
ID4070292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3948604 
End bp3949698 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content62% 
IMG OID637985352 
Producthypothetical protein 
Protein accessionYP_592405 
Protein GI94970357 
COG category[S] Function unknown 
COG ID[COG4320] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0308165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCC TGCAAGCCAG CCGCGAGTAC GAGCATTGGC TCGCGAAACA AACCGATCTC 
ATCGCGGCTG ACATTCGCCG CAAACACGCC TTCATGGCCC AGAGCGTGTT TCCATTTTTC
CGCGCCACGT TCTATCGCTG GCTGCAGCTC TGGCCCTCGC TCGATAAAGC CATCAGCGGC
GCGCCCAAAG TTCTCGCCGT CGGCGATCTC CACGTCGAAA ACTTCGGCAC CTGGCGCGAT
GGCGAAGGCC GCCTCGCGTG GGGCATTAAC GACTTCGACG AAGCATGGCT CTTCCCCTAC
ACTATGGACC TCGTCCGGCT CGCCACCAGC GCACTGTTGG CGAAGGATGC CGAGCATCTC
GCCGAACGCG GAAGGCTCGT GGCTGAAGCA ATTCTCGAGG GCTACCACGA CGCACTGGAA
CACGGCGGCA AACCGTTCGT TCTCGCCGAA GGCCAGGATT GGCTGCGCGC CATCGCCGCG
CAGCAATTGA AAGATCCCAA CGCCTATTGG GACAAGCTCA CATCGTGGCC GGAAGTGAGA
TTGAAAAACT TGCCGCAATT GGCCAAAGAT CGCATGACTG ACCTGCTGCC GCCCCATTGC
GACAAGCCAT TCTTCGTCGC GCGACAAGCC GGACTCGGAT CGCGCGGCCA CCAGCGCTAC
GTCGCCATCG CCCATTGGAA AGGTGGATGG GTCGCGCGCG AAGCCAAAGC CCTCGTTCCA
TCGGCGGCTG CATGGATCGC CGGCACCGGA CGCGATCGCA TCTACTACAA CGACATCCTC
GAGAACTCCG TCCGCGACCA CGATCCGTAC TTCAACGTGC ATGAACACTG GCTGGTACGC
CGGCTAGCCC CCGACTGCAC CAAGATCCCG ATCACCGATC TGCCCACGCG TCGCGACGAG
CACACGCTGC TCTACTGCAT GGGCTACGAA GTCGCCAACG TGCACCTCGG CACCAAGCGC
GCGAATGCTG CGATAGCGCA GGACCTGAAG AAGCGCAAAG CGCGCTGGCT CTACGATGCC
GCCCGCGACA TGCGCCGCTT GATTCGCCGC GACTTCGCCG AGTGGCGTAC CTCCCGTACT
CGCCCCGCGA AATAG
 
Protein sequence
MNILQASREY EHWLAKQTDL IAADIRRKHA FMAQSVFPFF RATFYRWLQL WPSLDKAISG 
APKVLAVGDL HVENFGTWRD GEGRLAWGIN DFDEAWLFPY TMDLVRLATS ALLAKDAEHL
AERGRLVAEA ILEGYHDALE HGGKPFVLAE GQDWLRAIAA QQLKDPNAYW DKLTSWPEVR
LKNLPQLAKD RMTDLLPPHC DKPFFVARQA GLGSRGHQRY VAIAHWKGGW VAREAKALVP
SAAAWIAGTG RDRIYYNDIL ENSVRDHDPY FNVHEHWLVR RLAPDCTKIP ITDLPTRRDE
HTLLYCMGYE VANVHLGTKR ANAAIAQDLK KRKARWLYDA ARDMRRLIRR DFAEWRTSRT
RPAK