Gene Acid345_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1991 
Symbol 
ID4070897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2385727 
End bp2387976 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content60% 
IMG OID637984005 
ProductTonB-dependent receptor 
Protein accessionYP_591066 
Protein GI94969018 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAGT ACCGCATGAT TGCGGCGCTG ATATTCGTAT TGGCAGCAAT TCAGAGTTTT 
GCTTCGGTCG TGGGATCGGT GCGCGGAATC GTGCACGATC CGCAGCATCG TCCGCTGGAC
AGCGCGACAA TCACGCTGAC GGCACGCGGA TCGGACTACC GACAGACCAC AACGACGAAC
AGCGCCGGCG AATTCGCTTT CGCGGCCGTG CCTGCCGGCG AATATGAGAT TACCGTTGTC
GCAACAAATT TTCGCGCCGC TGTACAGAGC GTGCAGGTGA CGGCGAGCGC CGCGCCGGTG
CTGCACTTCC AACTCGCGGT GGCAACGCAA AGCCAAACCG TTGAGGTCAC CGACAGCGGG
GAAGGGCTCG ACGTGCAGCC GGCAACGGGA ATGGTGAGCA GTCGCGATAT CGCGCGCACC
CCCGGCGCAA CACGCACGAA CAGTCTTGCG ATGATCACCA ACTTCGTGCC GGGCGCCTAC
ATGGTCCACG ACCAGTTGCA TGTGCGCGGC GGACACCAGG TTAGCTGGAT GATCGACGGC
ATTCCGGTGC CGAATACGAA CATCGCGAGC AATGTCGGAC CGCAGATCGA TCCCAAGGAC
ATCGATTATC TCGAGGTGCA ACGCGGCGAC TACTCGGCGG AGTTCGGGGA TCGCGCTTAC
GGTGTGTTCA ACGCGGTGAC GCATAGCGGC TTCGAGCGCA ATCGCGAGGG TGAACTGATC
CTCAACTACG GCAGCTACAA CCAGACCAAC AGTCAAATCA ATGCGGGCGA TCACACACAG
ACTTTCGCGT GGTATGCGAG CTTGAGTGGC AATCGCACCG ACGTTGGACT GGAGACACCG
ACGCCGGAAG TGCTGCACGA CATGAACAGC GGCGTGAGTG GGTTTCTTTC GCTGATCTGG
AACAAAGGAA ACCACGATCA ATTTCGCGTA GTGAGTTCCG CGCGCAACGA TTTTTACCAG
GTGCCGAATA CTCCGGAGCA GCAGGCAGCC GGGGTGGCGG ACACCGAGCG CGAACACGAT
GTGTTCCTGA ACGGCGCGTG GGTGCATACG ACGGCGTCGA ATACGGTCTT CACCTTGGCC
CCGTTCTATC ACTTGAATCA CGCTGCTTAC GATGGCAGCG CGGCGGATGA TCCGGTGATT
CCTGTTCAGG ACCGCACCTC GCAGTACGCG GGCGTGTTTG CAGCGGCGAC GCTGGTGAAG
GCTAAGAACA CGCTGCGGCT TGGCACGCAG ATGTATGCGC AGCACGACAA CGCATTCTTC
GGCGTAACGT GCAGCGAGAG CGGACTGACA GCCGAGACGT GCGGACCGGA TGCGGATCCG
CCATCGCCAA CCGCGGTGAA TGATCGCGTG ACGCCGTGGG GCGGCGTGGA AGCGGTGTTC
GCCGAAGACA CTTACAATCC GCTGCCGTGG CTGCGCGTGA ATGGCGGGCT GCGCTTCACG
CACTTCGATG GCGAGATCAG CGAGTCTTCG GTCGATCCGC GAGTGGGGGC GCAGATCAGG
CTGCCGCACT TGCAATGGGT GCTGCACGGG TTCTACGGGC GCTACTATCA GCCTCCACCG
CTGGCGACGG TCGGAGGGCC GATTCTCGAA CTTGCGAACC AGCAGGGCTT TGGCTTCCTG
CCGTTGAAGG GCGAGCGTGA CGAGCAGTGG GAAGCGGGCG TCTCCGTGCC GTTTCGCAAA
TGGCGCGGCG ATGTCAGCTA CTTCCAAACC AAAGCGAGGA ATTTCTTCGA CCACGATGTG
CTGGGGAACT CGAACATTTT TTTCCCGCTG ACGATCGCAC GGGCGCACAT CCATGGAACG
GAAGTTACAG TGAACTCGCC GACTGTATTC GGCCGCGCGC AGTGGCACCT GGTGTTCTCC
CGTCAGTGGG CGGAAGGATC GGGCGGGATA ACGGGTGGCC TCACCGATTT CTCACCGCCG
GAGGAGGGAA GTTTTTTCCT CGACCACGAT CAGCGCACGA CGATTGCGAC CGGGATGACG
GTGAACTTGC CGTGGCGGAC TTGGGTATCT GCGAATTTCG CATTCGGGTC CGGATTCCTT
TATGAGGATG GGCCGCAGCA TCTTGGGTCG AACAATACCG TGGACCTGGC AGTGACGAAG
TCGATCGGGG AGCGATGGAG TATCGGAGCT TCGTTCATCA ATGTGGCGGA TCACCGGTTC
CTGATTGACG CCGCCAACAC CTTCGGCGGG ACGCATTGGT CGGAGCCGCT GCAGGTGACG
GGGGAAGTGA AGTATCGATT TAAGTTCTGA
 
Protein sequence
MFKYRMIAAL IFVLAAIQSF ASVVGSVRGI VHDPQHRPLD SATITLTARG SDYRQTTTTN 
SAGEFAFAAV PAGEYEITVV ATNFRAAVQS VQVTASAAPV LHFQLAVATQ SQTVEVTDSG
EGLDVQPATG MVSSRDIART PGATRTNSLA MITNFVPGAY MVHDQLHVRG GHQVSWMIDG
IPVPNTNIAS NVGPQIDPKD IDYLEVQRGD YSAEFGDRAY GVFNAVTHSG FERNREGELI
LNYGSYNQTN SQINAGDHTQ TFAWYASLSG NRTDVGLETP TPEVLHDMNS GVSGFLSLIW
NKGNHDQFRV VSSARNDFYQ VPNTPEQQAA GVADTEREHD VFLNGAWVHT TASNTVFTLA
PFYHLNHAAY DGSAADDPVI PVQDRTSQYA GVFAAATLVK AKNTLRLGTQ MYAQHDNAFF
GVTCSESGLT AETCGPDADP PSPTAVNDRV TPWGGVEAVF AEDTYNPLPW LRVNGGLRFT
HFDGEISESS VDPRVGAQIR LPHLQWVLHG FYGRYYQPPP LATVGGPILE LANQQGFGFL
PLKGERDEQW EAGVSVPFRK WRGDVSYFQT KARNFFDHDV LGNSNIFFPL TIARAHIHGT
EVTVNSPTVF GRAQWHLVFS RQWAEGSGGI TGGLTDFSPP EEGSFFLDHD QRTTIATGMT
VNLPWRTWVS ANFAFGSGFL YEDGPQHLGS NNTVDLAVTK SIGERWSIGA SFINVADHRF
LIDAANTFGG THWSEPLQVT GEVKYRFKF