Gene Acid345_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2136 
Symbol 
ID4072378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2552116 
End bp2555490 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content58% 
IMG OID637984151 
ProductTonB-dependent receptor 
Protein accessionYP_591211 
Protein GI94969163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0917219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATT GGCGGCTTCG CCTGGGCGCG TTGTTACTTG GTTGCTTGTG TGCGGGAAGT 
CTGTTCGCCC AGGAGATTAC GGGCGACATC CGCGGAATTG TGAAAGATGC TTCGGGCGCA
CTCGTCGCCG GAGCCACCGT TGAGGTAACC AACACTGATC GCAACACGAC AATTCGTACC
GTCACGACCG ATACCAACGG CAATTACGTT GCCGCTTACC TACCCGTCGG TCATTACAAG
GTCTCGGTCA AGAAGGAAGG CTTCAAGGCA GCTGAGACCA ACAATGTGGT CTTGAACGTG
CATGATCGCC TTACGGTGGA TGAAACGCTC CAGGTCGGCT CTAGCGGCCA GACCGTAACG
GTCAACGAGA ATCCCAGCCA GGTGAATCTC GACAACGCGA CTGCCCAAGG CGTGATCACC
GGCAACCAGG TGCGGCAGCT CACACTCGTC ACGCGTAATT ACGAGCAGTT GGTTGCAGCC
CTTCCCGGCG TTTCGACGAA CCTCGCTTCC GATCAGCTGT TCGTCGGCGT AAGCAATCCG
GTCGGCACCT CAAACCAGAT CAACTTTTCG ATTAATGGCA CCCGTCCAAC GCAGAACAAC
TGGCAGATCG ACGGTTCCGA CAACGTGGAC CGCGGCGCCA ACCTGACGCT GCTCGCCTAT
CCGAGCGTGG ATTCGATCCA GGAGTTCAAC GTCCTGCGCT CGAACTACAT GCCGGAACAA
GGACGCAGCT CAGGCGGACA GGTCAACGTC ATCACGCGTT CCGGCACCAG CGCCTTCCAC
GGCAGCGCGT ACGAGTTCTT CCGAAACGAT GTGCTGAACG CCAACAACTT CTTCAATAAT
CGTGCCGACG TTGAACGCCC CGCGATGCGT TGGAACGACT TTGGCTTCAC CATCGGCGGA
CCGATCTACA TTCCCGGCCA CTACAACACG GAAAAGAACA AGACGTTCTT CTTCTATTCG
CAAGAGTGGC GAAAGATCAT CACCTACAAC ACGTTCACCA GCGGCGTGCT GCCCACGTCG
GCAAACCTCG GAGGCGATTT CGGAAGCACG ATTTGCGTCG CTTTGAATCC CGATGGGACG
TGCGCAGCGT TGGGCAATCA TGTCTCCACG ATTAGTCCCA CGGCGCAGGC ATACATCAAC
GACATCTATT CAAAGTTCCC AGCGCCCAAC AATGCTGACG GAACGCTCAC CTGGGTAGGA
CGCAACCAGT TCAACTATCG CGAAGAGAAC GTTCGCGTTG ACCACAATTT CTCGTCCAAG
TTCAGCATCT TCGGACGCTA CCTCGACGAC CAGATCCCAA CGCAGGAGCC TGGCGGTCTG
TTTACCGGTC TCGCCGTTCC TGGCGTTGCT GTGACCAACA CGAATGCTCC CGGACGCAAC
GCCTCGATTC ACGCGACGAT TGCGTTCTCG CCCACGACGC TGGCGGACAT GGGCTATGCG
TACTCGTATG GCGCGGTCAT CAGTTCGCCG GCGGGAACCA TGGCCTCAGC GAATTCGCCG
GATGTGAATC CGACGCTTCC CTTCGGACTC GGTCCGCTGC TTCCGGGCAT CGGATTCTTC
AATTCCACGC AGGGGCTCGC CGGATTCGGT CCATACAACG ACTACAACTA CAACCACAAC
GCGTTCGCTA CGTTGACGAA GGTGATTGGA AAGCACTCGC TGAAATTCGG CGGGACCTTC
AACTACTACA CCAAGGACGA GAACGTGAAT GGCTACGGGC TGCAATCGGG CTCCTACACG
TTCGCGGATT GCGTGGATAG CAGTGCTACC GTCACCAGCC CGTATCCCTG CTCCGACACC
GGCAGCACCG ATCAGGAGTG GGCGAACTTC CTCAACGGCA ACGTGTCGTC GTTCAACCAG
ACAAACATTG ACTTCCGCGC GCTGGTACAT CAGCACCAGT GGGAATTCTT TGGTCAGGAT
GAGTGGCGTC TCACACCGTA CTTCACACTC AGCTACGGCG TGCGCTACTC GCTCTTCCAG
GCGCCCACCT ACGGCAACGG CCTGCTTACG ACCTTCGATC CGTCGAAGTT CGATTCCACC
AACACGCCCG CGATCGACAG CAACGGTCTT TACGCGGCCG TGCCATCTGC GCCGTATACC
AATGGCATCC TGATCGGCGG CAAGGATTCT CCGTATGGCG ATGCCGTGAA CCGCACGCCG
AAACTCAACT TCGCGCCGCG CTTAGGGTTT GCATGGGACC CGACGCATAC CGGCACGACT
TCTATCCGCG GCGGATTCGG ACTGTTCTTC GATTCACCTG CCGTGAACAG CATGGAACAG
TTCCAGCCGG GAAATCCGCC GTTCGTTACT TCGACCTCAA TTTCGAACAC CAATTTCGAC
AATCCGGGTT CAGTACAGGC GGCGCCAAAC CTGTCACCTC CCGACATTGG CGGCATCGCT
CCTAACTGGA AGCAGCCGTA CACGATGATG TGGAGCCTGG ACGTGCAGCA CCAGTTCACG
CCGTCCACCA TCTTCGACAT TGGCTACTAC GGCAACGCGG GACGCCATCT TATTGGCGTT
GTAGACGTAA ACCAAGCGCC ACTCGGCGGC TTCCAAGCCC TCGGCATTCC GGGGCCGGTC
AGTTCCGGTG ACACGCAGAA GCTCAATCAG ATCCGTCCGT ACCAGGGCTA TGCGTCAATC
GACTTGTTCT CGCCGGTATT TACGTCGAGC TACAACGGCC TGCAGACGTC GTTCACCAAG
CACTTCACCG AGAATTCGAT GATCGTGCTG AACTACACCT GGTCGCACGC TCTGGGCACA
GCTTCGAGCG ACTACCGTGC GCCGCAGTAT TCCATGGATA TTGGCGCGGA ATACGGCAAC
CTCGACTACG ACCGTCGCAA CATGTTCACC GCCAACTATG TGTACGACCT GCCGTTCTTC
AAGCACCAGC AGGGGGTTGC GGGACACGTG CTCGGCGGTT GGGAAGTCTC CGGATTGTTC
TATGCGTATA GCGGGGCGCA CTACACCGCG AGCGCATCGC GCGATCCCGG CGGCCTCGGC
TTGCGTGATC CGAACACCTT CGAGGGCGGG CGCCCCGACC TCATTGGCAA CCCTCAGCAG
GGCGCACCGA ACCATCTCGA CAAGTGGTTC AATACCTCAG CGTTTGCGCT CGTGCCAGCC
GGCGACGTCC GTGTGGGTAA CGAGCCGCGC GGCGTCATCG TGGGGCCGGG CTACTTCCGT
TGGGATGCTT CGCTGTTCAA GAACATCAAG TTCACCGAAC GCTTGAACTT GCAGTTCCGT
GCGGAAGCTT TCAACGTGCT CAACCACACG AACTTCAACG CTCCCAACGT CAGTGCGACG
AGCTCGCTCT TCGGACAGAT ACTGTCCGCA CGCGATCCTC GGCAGCTACA GCTTGCCCTG
AAGTTGACCT TCTAA
 
Protein sequence
MSHWRLRLGA LLLGCLCAGS LFAQEITGDI RGIVKDASGA LVAGATVEVT NTDRNTTIRT 
VTTDTNGNYV AAYLPVGHYK VSVKKEGFKA AETNNVVLNV HDRLTVDETL QVGSSGQTVT
VNENPSQVNL DNATAQGVIT GNQVRQLTLV TRNYEQLVAA LPGVSTNLAS DQLFVGVSNP
VGTSNQINFS INGTRPTQNN WQIDGSDNVD RGANLTLLAY PSVDSIQEFN VLRSNYMPEQ
GRSSGGQVNV ITRSGTSAFH GSAYEFFRND VLNANNFFNN RADVERPAMR WNDFGFTIGG
PIYIPGHYNT EKNKTFFFYS QEWRKIITYN TFTSGVLPTS ANLGGDFGST ICVALNPDGT
CAALGNHVST ISPTAQAYIN DIYSKFPAPN NADGTLTWVG RNQFNYREEN VRVDHNFSSK
FSIFGRYLDD QIPTQEPGGL FTGLAVPGVA VTNTNAPGRN ASIHATIAFS PTTLADMGYA
YSYGAVISSP AGTMASANSP DVNPTLPFGL GPLLPGIGFF NSTQGLAGFG PYNDYNYNHN
AFATLTKVIG KHSLKFGGTF NYYTKDENVN GYGLQSGSYT FADCVDSSAT VTSPYPCSDT
GSTDQEWANF LNGNVSSFNQ TNIDFRALVH QHQWEFFGQD EWRLTPYFTL SYGVRYSLFQ
APTYGNGLLT TFDPSKFDST NTPAIDSNGL YAAVPSAPYT NGILIGGKDS PYGDAVNRTP
KLNFAPRLGF AWDPTHTGTT SIRGGFGLFF DSPAVNSMEQ FQPGNPPFVT STSISNTNFD
NPGSVQAAPN LSPPDIGGIA PNWKQPYTMM WSLDVQHQFT PSTIFDIGYY GNAGRHLIGV
VDVNQAPLGG FQALGIPGPV SSGDTQKLNQ IRPYQGYASI DLFSPVFTSS YNGLQTSFTK
HFTENSMIVL NYTWSHALGT ASSDYRAPQY SMDIGAEYGN LDYDRRNMFT ANYVYDLPFF
KHQQGVAGHV LGGWEVSGLF YAYSGAHYTA SASRDPGGLG LRDPNTFEGG RPDLIGNPQQ
GAPNHLDKWF NTSAFALVPA GDVRVGNEPR GVIVGPGYFR WDASLFKNIK FTERLNLQFR
AEAFNVLNHT NFNAPNVSAT SSLFGQILSA RDPRQLQLAL KLTF