Gene Acid345_3457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3457 
Symbol 
ID4069033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4077205 
End bp4078137 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content61% 
IMG OID637985479 
ProductMarR family transcriptional regulator 
Protein accessionYP_592532 
Protein GI94970484 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1246] N-acetylglutamate synthase and related acetyltransferases
[COG1846] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.965318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0098073 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGAAG GCTTGCAGTC GCGGATTCGA AATGTCCGTA GTTTTAACCG GTTCTACACG 
CGGGAGATTG GCGTGCTCCG CGAGAGTTTC CTCGATGCGG GGTTCACGTT GCCCGAGCTG
CGGGTACTGT ACGAAGTGGC GCACGAGGAC GGGATCACGG CTTCGCAGAT TGCCGAGCGA
CTCGGGATGG ACCCGGGCTA TGTGAGCCGA CTGGTCCAGG AACTCAAGGC CCGGCAGTTG
TTGTCGCGAA CGAAGTCGAC GGAAGACTCG CGGCAAACCA TCCTGCGGCT GACGAAGCGC
GGCGAACAGC AGTTCACGGC GCAGAACAAG CGCCAGAATG AAGAAGTTGA GAAGATGCTG
GCGAAGATCC CGGAGTACGA TCAGCGGCAA CTTGTTTCAG CGATGAACAC TGTCCAGCGG
ATTCTCGGCG GCACGACAAC CTCCAAGAGT GCCGTCATTC TGCGAGAGCC GCGCCTTGGC
GATATGGGAT GGGTGTTGTG CGGACATGGC GAAGGCTATG CCGATGTCTA TGGCCTCGAC
GTGCGGTTCG AGGCGCTCGT CGCGCGAATC CTTGCGGACT TCATGGCATC GCGCGACCCG
GCGAAGGAAC GAGCGTGGAT CGCTGAGCGC GATGGCGAAC GCATGGGCTG CGTGTTCCTG
GTACGGCATC CGGAGCACAA AGACACGGCG AAGTTGCGGC TATTGTGGGT GGAGGCGGCA
GCGCGCGGCC TGGGCGTGGG CAAGGCGCTA GTACATGAGT GCACGCGATT CGCGAAACAA
GCCGGGTACA AGCGGATCGT GTTGTGGACG AACAGCGTAC TTGCGACTGC ACGAGCCATC
TATGAGCGCG AGGGATATAC ACTCGTGTCG GAGCAGGCAG AGCCGATCTT CGCGCAAGGG
CAGAGGGCGC AGGAGTGGGA GTTGGGACTC TAG
 
Protein sequence
MGEGLQSRIR NVRSFNRFYT REIGVLRESF LDAGFTLPEL RVLYEVAHED GITASQIAER 
LGMDPGYVSR LVQELKARQL LSRTKSTEDS RQTILRLTKR GEQQFTAQNK RQNEEVEKML
AKIPEYDQRQ LVSAMNTVQR ILGGTTTSKS AVILREPRLG DMGWVLCGHG EGYADVYGLD
VRFEALVARI LADFMASRDP AKERAWIAER DGERMGCVFL VRHPEHKDTA KLRLLWVEAA
ARGLGVGKAL VHECTRFAKQ AGYKRIVLWT NSVLATARAI YEREGYTLVS EQAEPIFAQG
QRAQEWELGL