Gene Acid345_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1406 
Symbol 
ID4068747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1704034 
End bp1705413 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content59% 
IMG OID637983415 
Producthypothetical protein 
Protein accessionYP_590482 
Protein GI94968434 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0324522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCCC TTATTTTTGC GGCCAGCGTC AGCGCAGTGG TTGCTTTCGC AATCACCTAC 
TTCGCTCTTC GCTCGCAGTC CGCCGCGGCC GTGGCACGGC TTTCGGTCGC CGAGCGCGAA
CTGGTAACCG CACGCATGGA AGCAGCGAAT AAAAGCTCGC AGGCTTCGCA ACTCGACCGC
GAACTCGCTG GGCTGCGCGC TACTTTGGAC GCTGAAAAGA AGGCTGCTGA CGAAAAACTT
GCCGCGGTCC TCCAGGCACG CGACGACATG AAGGCACAGT TCGAGGCGCT TGCCGCATCC
ACGCTTCAAG CCAATAGCCG GTCATTCCTT GATTTGGCGC AGACCAAGCT TACCGATTTC
CAGAACCAGG CCAAGGGCGA TCTCAACGAA CGGCAGCAAG CCATTGAGAA CCTGGTTAAG
CCGATCCACG AGAGCCTGAC GAAATTCGAC GGACAGATCC AGCAGATTGA AAAATCGCGC
AACGAGGCAT ATGGATCGCT GCTGAACCAA CTGCAGACGC TGACGCAGTC GAATGACCAG
CTGCGGGTGC AGACCGGAGC GCTGGTTACC GCGCTGCGCG CGCCACAGGG GCGCGGACGT
TGGGGGGAAA TCCAGTTGCG TCGTGTTGCC GAGATGGCGG GGATGATCAA TCGCTGCGAC
TTCCTCGAGC AGGAACACCT CTCTACGGAA GACGGCGGTC TGCGTCCGGA TATGGTCGTG
AAGCTGCCCG GCGGCAAGAC CGTGGTCGTT GACGCGAAGA CTCCGCTTGC GGCTTATTTG
AGCGCGCTCG AAGCGACGAG CGATACCGAG CGCGCTGAGT TTCTGCGGCA GCACGCGGCG
CAGGTTCGGG TGCACATCAA GAAATTGGGA GCGAAGAGCT ACTGGGAGCA GTTTGAAAAT
GCTCCTGAGA TGGTCGTGAT GTTCCTGCCG AACGAAGCGT TCTTCAGCGC TGCGCTTGCC
GAAGATCCGA CGCTGATCGA AGCGGGCGTT GCCGACAAGG TCATCATCGC GTCGCCGACT
ACGCTGATCG CGCTGCTCCG CGCTGTCCAT TACGGATGGC AGCAGCAGGA GATCGCGCGC
AATGCGGTCG AGGTCAGCCA ACTTGGCAAA GATCTCTACG AACGGTTGTG CACGATGGTG
GGCTACTTCG AAGACGTGGG CGGCAAACTG GATGGCGCGG TGAAGGCATA CAACAAAGCT
GTGTCGTCGC TGGAATCGCG TGTACTTTCG AAGGCGCGCA AGTTTCCCGA CCTCGCCATA
CAAATCAAAG AAGAGATCCC GCAGATCGAA CAGATTGAGC CCACCACGAA GAAACTTCAG
GCAGGCGATT GGACCCAGGA AGCCGAACAA CTTCCGCTTG TGGATAAGGC CGGCGTCTGA
 
Protein sequence
MGSLIFAASV SAVVAFAITY FALRSQSAAA VARLSVAERE LVTARMEAAN KSSQASQLDR 
ELAGLRATLD AEKKAADEKL AAVLQARDDM KAQFEALAAS TLQANSRSFL DLAQTKLTDF
QNQAKGDLNE RQQAIENLVK PIHESLTKFD GQIQQIEKSR NEAYGSLLNQ LQTLTQSNDQ
LRVQTGALVT ALRAPQGRGR WGEIQLRRVA EMAGMINRCD FLEQEHLSTE DGGLRPDMVV
KLPGGKTVVV DAKTPLAAYL SALEATSDTE RAEFLRQHAA QVRVHIKKLG AKSYWEQFEN
APEMVVMFLP NEAFFSAALA EDPTLIEAGV ADKVIIASPT TLIALLRAVH YGWQQQEIAR
NAVEVSQLGK DLYERLCTMV GYFEDVGGKL DGAVKAYNKA VSSLESRVLS KARKFPDLAI
QIKEEIPQIE QIEPTTKKLQ AGDWTQEAEQ LPLVDKAGV