Gene Acid345_0315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0315 
Symbol 
ID4068592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp338261 
End bp341638 
Gene Length3378 bp 
Protein Length1125 aa 
Translation table11 
GC content57% 
IMG OID637982318 
Producthypothetical protein 
Protein accessionYP_589394 
Protein GI94967346 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.693881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC TGAAGTCTGT TCTCCTGATC GCAATTGTCA TCCTATGCAG CGGGTTCGCC 
GCGGCCCAAG GCTATCTCAG TACCATCAAC GGCACAGTGG CCGACAAAAG CGGTGCTGTC
ATTCCGAACG CCAAGGTCAC CGTCACGAAC ATCGCCAAGG GCCTCAGCCA CACCTATACA
ACCAGTCAGA CTGGAACCTA CACGGCCCCC AGCCTCGAAC CCGGCGACTA CACCGTGACC
GCGGAAGCCC AGGGATTCGG AACCTTATCC AAAGAAAGTC TCCATCTCGA AGTTGCGCAC
GATCTTCGGA TTGACTTTCA ACTCTTGCCC GGCGTCGCAA AAGAAACGGT GGCGGTGGAG
GAAGTAGCGC CTCTGATCAA CACTTCCAAC GCCGTTTTGG AAAGCACGTT CACCAACGAG
CAAATCAATC AACTGCCGCT GCAGGGTCGC GACTTCCAGA ACCTGGTCGT GTTGAACCCT
GGCGTGCAGC GTCGCCCGGG CGGCGGATTC CTCTCGATTT CTTCGAATGG CAACCGCGTT
GAAGACAACA ACTTCATCGT CGACGGCATT GACGACAACG ACGCCTACTA CGGCACGACC
GTGATCAACG CCGAGGGCGT CGAAGGCACG CCGGCAACGC ACCTGCCGAT CGACGCGATC
CAGGAGTTCA ACGTTCAATC CAGTCCTGAA GCCGAGTACG GCTGGAAACC CGGCGCCATC
GTGAACATCG GCATCAAGTC CGGCACCAAC GCGCTCCACG GTACTGCGTA CTACTTCCAT
CGCAACAGCG CGCTGGATGC CCGGAACTAT TTCAATCCCG GCGATCAACC CGTTTCCGCC
CTTCGACTGC ACCAGTTCGG CGGATCCATC GGCGGACCGA TCAAGAAGGA CAAGCTGTTC
TTCTTTGCGA ACTACGAAGG CGTTCGCGAC GTGGTGGGAA ATCCACTTGA GTTGAACACG
CCTGTTACCG TTCCGTACGG TGACCCCTCC ATAAGCATTC CAGACGCGGT TGCCGCCTGC
ACGGCGGACG GAAGTTGCAA TGACCTCAGC TCCAGTTTGC TGAAGTACTT CCCCACGAAC
CCCGGCACGG CAAATCCGGC CGATCCGACG CTCCAGACGT TCGACTTCAA CAATACCAAC
CGCGAAGACA ACGGCATTGT GAAGGTTGAC TACCAGATCA ACGAGCACAA CAACGCCAGT
TTCAGGTATT TCATCGGCGA CAGTTTGCAG GTGGAAGAAG CGCACAACGT GCTGCAACCT
CAGTGGCTGG ACCAGGCGCA GACACGCGCC CAGGTCATCG GAGCCAGCTA TAACTGGATG
CCCAAAAACA ACGTCGTCAC GCAGTGGCGG TTCGGATTCA ATCGCTTCTC ACAGCAAATC
TTCACGGTCG ATCACACCGT TAATCCCACC GAGTATGGAA TCAATTCTGG TGTAACCGAT
CCGCTCGATT TCGGGTTCCC GGAAATTGCG ATCAGTGGAT TCAACCGGCT TGGAGGCTCT
TCAAATTGGC CGCTCTTTAC TACCCCCAAC TTGACCTACC AAATCACGGA AAACACCACG
TGGAGCCACG GTAACCATTC TTTTAAGTTC GGTGGCGAAT TTCGCACTGG CAGTACCGAC
AACCTTCGCG ATCTCGATGG CAAGGGCTAC GTAGCCTTCC GCGGCCTCGA ACGATTCACC
ACCGGCGACT ATCGCTACGG CTACATCAAA GCGGGAGACA CCCACCGCGT CGTGAGTCAG
AAGTCATTTG GCTTCTATGT GCAGGATGAC TGGCGCATCA AGCCTCGCTT CACTGTCACC
GGCGGTCTGC GTTACGACGT CAGCCTTCCG ATTACCGAAC AGAACGACCT CCTGGCCAAT
TTCGATCCTG CCGTTGGAAT CCAGCAGGTT GGCAAGCAGA TCGACAAGCC CTACAACACC
GACTGGAACA ACTTTGCTCC GCGCATCGGC TTCGCTTGGG ACCTCTTTGG TACCGGCAAA
ACCGTACTCC GCGCCGGTAG CGGCATCATC TACGAGATCC CACACATCTC TCTGTTCGTT
TCGCAGAACT CTGCATCCGC ACTCGGACTG GCGAATATTC CTTCCGGCGC GAACGGCATC
TCCGTTCCTG GCGGCAACAT CACTACCGCG GATTTAAAAG TGAGTGACCT CAACTGGTCG
TCTGCTGGCC CCATCTTCGG CGACATTCCT TCTACGAAAC TCACCTGCGA CAGCGATACA
CCGTGCTCCA TCCTCGGCGT GGACAAGAAC ATCAAGACGC CGTACTCGTT GAGCTGGAAC
CTGAACCTGG AACAGCAACT CACCCGCAAT ATGGCGTTGC AGGTCGCCTA CGTCGGCAAT
CACGGCGTGA AGTTGTACAG CATTCGCAAC ATCAACCAGA ACCAGTGGTA CTACGACTGG
GTTGCCAACG GTATGGGCGA TCCCTACGAA GACACCGACG GCTATTCCGG ACGTCCATTC
GCCAACCCGA CCCTCTGCGG CACTCACTGC TTCCCCTACC TGGCCGAAGT TGAGATGCTG
CAGAACAAGG ACAGTTCGAT CTATCACAGC CTGCAGGTAA CGTTGCGGCA GCACAGCTAC
AAAGGCATCG ACTTCGTGGC TGGCTATACA TGGGCGCACG CCATTGACCA GTACGGCAGC
AACCGCTCGT ACGATTGGGA AGACCCGAAC AACGGCCTTC TTGAGCGCGG CGACGCCAGT
TCCGACATCC GGAATCGCTT CACCCTTGCC TTCACCTACC AGACGCCTAA ACTTCCCCGC
TATGACAAAG CATTCGGCGG CTGGACTTTC AACACTATCC TCAACTTCGA AGGCGGGCGT
CCCATCGATC TCTATGACGG CGACAGCGCG TTCAGCGGAG CTTACACCGG CAATGATCGG
TGGAGTATTT ATGGGGATCC GTCGAATTTG AAGTGGGGCT TCAATGGGCT TCCGTTCATC
CCCGTTGGAA CTCCTCAGTG GGCGAAGACG TGTGGACCGT ATGCACCTGG TCTGCTTCCC
GGAATCGATC CTGGCACCGA CTATCCGACG ACGTCGGATT ACGCGGGCGG ATGCTACGCG
GAGAATGGCA CCGTGCTGGT TCCGGCACAG TGGGGACAGT TCGGCAATAT GCACCGGAAC
ATGTTCCGCG GACCGGGCTA CTCGGATGTT GATATGTCGC TCGCCAAAAC CTTCAAGTTG
ACTGAACGCT TCAGCCTGCA AGTTCGCGGC GAGTTCTTCA ACATCCTCAA CCATCCGAAC
TTCGCCGGAC TCGGAGCGAA CCTCGAAACG GGTGCAGTCG GAATTCCGGC TTACACACCC
GATGTCGCGG AATCGAATCC AGTCGTTGGT TCCGGTGGAA GCCGCCACAT TCAGGTTGGT
GCAAAGATCG TCTTCTAG
 
Protein sequence
MKRLKSVLLI AIVILCSGFA AAQGYLSTIN GTVADKSGAV IPNAKVTVTN IAKGLSHTYT 
TSQTGTYTAP SLEPGDYTVT AEAQGFGTLS KESLHLEVAH DLRIDFQLLP GVAKETVAVE
EVAPLINTSN AVLESTFTNE QINQLPLQGR DFQNLVVLNP GVQRRPGGGF LSISSNGNRV
EDNNFIVDGI DDNDAYYGTT VINAEGVEGT PATHLPIDAI QEFNVQSSPE AEYGWKPGAI
VNIGIKSGTN ALHGTAYYFH RNSALDARNY FNPGDQPVSA LRLHQFGGSI GGPIKKDKLF
FFANYEGVRD VVGNPLELNT PVTVPYGDPS ISIPDAVAAC TADGSCNDLS SSLLKYFPTN
PGTANPADPT LQTFDFNNTN REDNGIVKVD YQINEHNNAS FRYFIGDSLQ VEEAHNVLQP
QWLDQAQTRA QVIGASYNWM PKNNVVTQWR FGFNRFSQQI FTVDHTVNPT EYGINSGVTD
PLDFGFPEIA ISGFNRLGGS SNWPLFTTPN LTYQITENTT WSHGNHSFKF GGEFRTGSTD
NLRDLDGKGY VAFRGLERFT TGDYRYGYIK AGDTHRVVSQ KSFGFYVQDD WRIKPRFTVT
GGLRYDVSLP ITEQNDLLAN FDPAVGIQQV GKQIDKPYNT DWNNFAPRIG FAWDLFGTGK
TVLRAGSGII YEIPHISLFV SQNSASALGL ANIPSGANGI SVPGGNITTA DLKVSDLNWS
SAGPIFGDIP STKLTCDSDT PCSILGVDKN IKTPYSLSWN LNLEQQLTRN MALQVAYVGN
HGVKLYSIRN INQNQWYYDW VANGMGDPYE DTDGYSGRPF ANPTLCGTHC FPYLAEVEML
QNKDSSIYHS LQVTLRQHSY KGIDFVAGYT WAHAIDQYGS NRSYDWEDPN NGLLERGDAS
SDIRNRFTLA FTYQTPKLPR YDKAFGGWTF NTILNFEGGR PIDLYDGDSA FSGAYTGNDR
WSIYGDPSNL KWGFNGLPFI PVGTPQWAKT CGPYAPGLLP GIDPGTDYPT TSDYAGGCYA
ENGTVLVPAQ WGQFGNMHRN MFRGPGYSDV DMSLAKTFKL TERFSLQVRG EFFNILNHPN
FAGLGANLET GAVGIPAYTP DVAESNPVVG SGGSRHIQVG AKIVF