Gene Acid345_0329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0329 
Symbol 
ID4070091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp357323 
End bp358366 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content59% 
IMG OID637982332 
ProductLacI family transcription regulator 
Protein accessionYP_589408 
Protein GI94967360 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.16025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.855853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTC GGGAGATTGC GAAGAGGGCA AAAGTTTCGA CCGCCACGGT CTCGCGGACG 
ATTAACCGCG TCCCAACCGT GGATCCGAAA CTGGCCAAGC GCGTCTGGCG GGTCGTTGAT
GAGCTCGGCT ATTTCCCGAA CACCCAGGCG CGCGCCCTGG TGTCAGGGCG GAGCCGAATC
CTCGGCTTGG TGGTCTCGGA AATCACCAAT CCGTTCTTCC CGGAAATCGT GCAGGTCTTC
GAAAACATCG CTGTCCAGAA CAACTACGAG ATCTTGCTCA CCTCTACGGG GCACGATCCC
GTGCGGATGG AAATCGCAGT CCGGCGGATG ATTGAGCACC GCGTGGAAGG TGTGGCACTG
ATGACCTTCG GGATGGAAGA GTCGCTTCTG GAAAACCTGA AGCGGCGGAA AATTCCGATG
GTGATTGTGG ACGTGGGGCC GCCGCGTCCG CTGGTGAGCA ATATCCGCGT GGATTACCAG
CATGGCATAC GGCAGGCTGT CCAGCACCTC GCCGCTCTCC GACATCACAG GATCGCGTTT
ATCTCAGGAC CGCTGCGGCT GCCATCGGCG CGGGCGAGGC TTGATGCGTT TAAGAACGCC
ATGCACGAAC TGGACTTGCC GGCACATGAT GAGTTGTGGG TGGAAGGTAC GCATACCATC
GAGGGCGGAG TCGAAGCTGC AGGGCGCCTG CTCTCGCTCC CCTCGCGGCC GACGGCAATT
ATGTGCTCGA ACGACATGAC GGCGCTGGGA GTCATGCGCA AGAGCCACGA ACTCGGCATC
CACATTCCGC ACGACCTCTC GCTCATCGGC TTCGACAACA TTCACATTTC CGAGTTCGTG
CTGCCTCCGC TGACGACGAT AGAGATGTCT CAGGCGGAGC TGGCAACGCT GGCATTTAAT
GCGTTACTCG CCGAGCTGCA ACGCAAAACG CCGAACCCGA ATGGAACGGA ATACGCGCTG
GAGACACACC TGATCTTGCG CGAGTCCACC GCACGTCCAA AGCAGGAAGC GGATAACGCA
AAGAAGAAAA AGGCCGCGCG GTAA
 
Protein sequence
MDIREIAKRA KVSTATVSRT INRVPTVDPK LAKRVWRVVD ELGYFPNTQA RALVSGRSRI 
LGLVVSEITN PFFPEIVQVF ENIAVQNNYE ILLTSTGHDP VRMEIAVRRM IEHRVEGVAL
MTFGMEESLL ENLKRRKIPM VIVDVGPPRP LVSNIRVDYQ HGIRQAVQHL AALRHHRIAF
ISGPLRLPSA RARLDAFKNA MHELDLPAHD ELWVEGTHTI EGGVEAAGRL LSLPSRPTAI
MCSNDMTALG VMRKSHELGI HIPHDLSLIG FDNIHISEFV LPPLTTIEMS QAELATLAFN
ALLAELQRKT PNPNGTEYAL ETHLILREST ARPKQEADNA KKKKAAR