Gene Acid345_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3397 
Symbol 
ID4072733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4021732 
End bp4022973 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content53% 
IMG OID637985419 
Producthypothetical protein 
Protein accessionYP_592472 
Protein GI94970424 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.117758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCCGGG GCCCCCTTCA AGACCTGCTG CGACGCACAA AACGGTTCAA ACGAGGAAGG 
ATTCGCAACC GGCGTGTCTC TCTTCAACGA CGCTTACGGC GGCGGAAGAA TCTGGTGCCT
CAGTTACGCG TTGTTATTCC TCCGAGCTTC TCGATCCTCG ATGATCCCGA ATCGAACCTC
GCTTTCATAT CGCGATTCCG AAGCCAGCTT TCCGCCAAGA TGTACAGGTC TGTTCACTTC
GATCACTCAG GGTGCAAGAA CTTGGGAATG GATGCGCAGG CGATTGTGGA TGTCCTCGTT
GCCGAGGAAT TGGCTAGGCG ACCGAACGGA ATTGGAATAG GTGGAGACTT TCCCCGCGAC
GCTCGGACGA ACGTAATGCT GCGAGCGATC GGAACGTTAC GCCAGTTCGG ACATCCCGAA
ATGAAGCTGG CTCCAGAAGT CGAGAGTCGC ATCGAACGCT GTGACCGCGT CAATGGCAAC
GGACACAACC TCAAATACAG TTCTGAGCGG GACCGTGCTG CACTAGCATT GGTCACGTAT
GTTGAGCGCG TGCTCCAACA TCAGTCCTTC ATGCTCACCT TCGAAGGTCG TTCCGACTTG
AGCAGCATAA TCACGGAGGT AATCGGCAAT GCCGAAGAAC ACAGCGGGCG CTGGTATGCC
GTCGCGTTCT CGCAGCCGGG CATCGTTGAA CAACAAGGCA TGCCTGAACC CGAGGAATGT
CAGATGGTCC TCTTTAATTT TGGCCGCTCC ATTTATGAGT CCTTGGTTTC GAGAGGAGCA
TCTACCTATG TGAAGGAGAG GATCTCAGCC TTGGCAAATG AGCATCGTCA TTCCAGGCAG
TTTTCCGACA GTTGGACAGA GGAAGATCTC TGGACTCTGG CAGCTCTCCA GCAAGGTGTC
AGTCGATATC GAACCGACGA AAAGGGGAAG ACGCGCGGAA ATGGGACGAT TGAACTCATA
CGCGCCTTTT CCGAGCTATC CGATGTACCC AAAAAAATGT GCGTCGTTTC AGGGCACACG
TATATACTCT TCGACGGAAG CTACAAATTG CGCGCCGACT CCAACGGTTT GCAAATGATT
GCCTTCAATA CATCGAACGA TTTGGAAAAA CCTCCAGACC CACGGTACGT TCGTCACTTG
AAACACGGGT TCCCAGGCAC GATCATCAGT ATGCGATTTG TGATGGATTC CAGATACCTG
GAATCGCGGA TTCAAAGCAA TGGCTCATCA GAACGTAATT GA
 
Protein sequence
MRRGPLQDLL RRTKRFKRGR IRNRRVSLQR RLRRRKNLVP QLRVVIPPSF SILDDPESNL 
AFISRFRSQL SAKMYRSVHF DHSGCKNLGM DAQAIVDVLV AEELARRPNG IGIGGDFPRD
ARTNVMLRAI GTLRQFGHPE MKLAPEVESR IERCDRVNGN GHNLKYSSER DRAALALVTY
VERVLQHQSF MLTFEGRSDL SSIITEVIGN AEEHSGRWYA VAFSQPGIVE QQGMPEPEEC
QMVLFNFGRS IYESLVSRGA STYVKERISA LANEHRHSRQ FSDSWTEEDL WTLAALQQGV
SRYRTDEKGK TRGNGTIELI RAFSELSDVP KKMCVVSGHT YILFDGSYKL RADSNGLQMI
AFNTSNDLEK PPDPRYVRHL KHGFPGTIIS MRFVMDSRYL ESRIQSNGSS ERN