Gene Acid345_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1012 
Symbol 
ID4069777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1275367 
End bp1276647 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content60% 
IMG OID637983019 
Producthypothetical protein 
Protein accessionYP_590089 
Protein GI94968041 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0134819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.109325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGAT TGCTCCGCCT TTGCAGCGTG TACCTTATCC TGTTGGCGCT CTGTCCCCCG 
ACGAGCGCCT ACTCCGTCCT CACTCACGAA GAAGTCATCG ACCTCGCGTG GGACCACGAC
ATCGTCCCCC TCATCAAGGC CCGCTTCCCC GACGCCACCG ATGACGATCT GAAAGAAGCC
CACGCCTACG CCTATGGCGG CGCCGTCATC CAGGACCTCG GCTACTACCC CTTCGGCAAC
CGCACCTTTA GCGATCTCGT GCACTACGTA CGCTCCGGCG ACTTCGTCGT CAACCTCATT
AACGAGTCCG ACAGCATCAA CGAGTACGCC TTCGCCCTCG GCGCCCTCGC TCACTACGCC
TCCGACATCA CCGGACACCC CATCGTCAAC CAGGCCGTCG CCATCGAGTT CCCCAAGCTC
CGCGCCAAGT ACGGCAGAGT TGTGACTTAT GCCGAAGACC ACTCCGCTCA CCTCGAAGTC
GAGTTTGGCT TCGACGTCAG CCAGGTCGCC AAGGGCCGCT ACGCCCCGCA GTCCTACCAC
GACTTCATCG GTTTCGAAGT TTCGAAACCA TTGCTCGAAC GCGCTTTCAA AGACACCTAC
GGTATCGAGA TCACCAGCGT CATCACCCAT GAAGACCTCG CCCTCGGGAC CTACCGCCGC
TCCGTCAGCA AAATCATCCC CGAGATGACC AGGGTTGCCG TCGCCAGCCG CGAAGACGAA
ATGAAGAAAG AGATTCCCGA CTACAACCGT CAGAAGTTCC TTTACCGCCT CTCACGCGCC
CAGTACGAAA AGCAATGGGG CAAGCAATAC CAGAAGCCCG GCGCCGGAGC CCGCATCCTC
GCCGTAGTCC TGAAGATCTT CCCCCACATC GGCCCGTTCC GCGGACTCGC CTATAAGAAC
CCGACACCGC AGACGCAAGA CCTCTACTTC AAGAGCGTCA ACAGCACCTA CGACTACTAC
ACCCAGCTCA CGGAACAACT GCGAAACGGC GAAGTCACTC TCCACGCCAT GGATCTCGAC
ACCGGCAAGC CCACCCGCCC CGGCGAATAC TCGCTCTCCG ACTCAACTTA CGCCGAACTC
GTGAACCGTC TCACCAGTCA AGACAACGCC GAGATCACCC CCGACCTCCG CGAAGCTCTC
CTGACCTACT TCTCCGATCA CGATGGCAAG CCCGTCGCCC TCAAAGATCC CAAGAAGCAA
GCCCAACTAG AATCCGACCT GCAAAAACTA AAAGCCGCGC CCCCACAGAG CGCAGCCCAA
ACCACTTCCT CTTCACACTA A
 
Protein sequence
MPRLLRLCSV YLILLALCPP TSAYSVLTHE EVIDLAWDHD IVPLIKARFP DATDDDLKEA 
HAYAYGGAVI QDLGYYPFGN RTFSDLVHYV RSGDFVVNLI NESDSINEYA FALGALAHYA
SDITGHPIVN QAVAIEFPKL RAKYGRVVTY AEDHSAHLEV EFGFDVSQVA KGRYAPQSYH
DFIGFEVSKP LLERAFKDTY GIEITSVITH EDLALGTYRR SVSKIIPEMT RVAVASREDE
MKKEIPDYNR QKFLYRLSRA QYEKQWGKQY QKPGAGARIL AVVLKIFPHI GPFRGLAYKN
PTPQTQDLYF KSVNSTYDYY TQLTEQLRNG EVTLHAMDLD TGKPTRPGEY SLSDSTYAEL
VNRLTSQDNA EITPDLREAL LTYFSDHDGK PVALKDPKKQ AQLESDLQKL KAAPPQSAAQ
TTSSSH