Gene Acid345_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0784 
Symbol 
ID4068565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp970813 
End bp974040 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content58% 
IMG OID637982791 
Producthypothetical protein 
Protein accessionYP_589863 
Protein GI94967815 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.410737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.275915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAG CTGTTCTCTT CACCTTCCTC ATCCTGCTGG TGATCTCTTC GATCGCTTTC 
GGCCAGCAAA CTACCGGACA AATCAACGGC ACCCTCGTAG ATTCCAGCAG TGCAGTCGTT
CCCAATGCGA CCGTTACCGC AAAGAACGTC GATAACGGAC TGACCCGCTC CACCAAGTCA
AGCACCACCG GCGGGTACAC CATAAATGAC CTTCCCCCTG GAACGTACAC CATCACCACG
GAAGCCCCTG GTTTTGCCAA GACCGTGAAT GAGCGCGTTC CGCTGCTCGT CGGCCAGGCC
TTGACGCTGA ACTTCACCCT CAAAACGGGC GGCGCCAATG AGACCATCAC CGTCACCGAG
GAAGCGCCGC TTATCGAATC GACGCGTTCC GATATCGGCG GCTCAGTATC GCCGTTGGAA
GTGAAGGAAC TGCCAATCGT GGACCGCAAC TTCGCGGGTT TGATGGCAAC CGTCCCCGGC
GTGCGTCCTG CAGAAGCCTT CGATCCGACC AAGACCCGCT CCGGCAACGT GAGCGTGAAC
GGCAGCGACG GCCGCTCCAT TGACTACAAC GTGGATGGCG GCGACAACAA GGACGTCGTC
ATCGGCGGCA TCGTCCAGAA CTTCACCATG GAAGGCATCC AGGAATTCCA GGTGACGACC
GACCGTTATA CGGCGGAATC CGGCCGCGCA GCGGCTGCTG TAGTGAACGT GATCAGCAAG
AGCGGTACCA ACGCTTTCCA CGGGACGGCG TTCAGCCTGT TCCAAAACAG CGGTCTGAAC
AGCAACAGCT ACTTCAATGA AATCGCCGGA AACCCGAAAA ACAAATTCCA TCGCTACCAG
TTTGGTGGTT CTGCCGGCGG TCCAATCATC AAGGACAAGC TGTTTTTCTT CGGCGCATAC
GAGCAGAAGC GCGAACCCCA GGACATTGGC GTCGATCCTT CTGCATTCGA CAATCTCACC
CTGTTCGCGG CTGCATTCCC GGACTATGCG GTTCCCATCA GGAAGCTCGA CTACTCGTAC
CTCGACCAGC AGCTGACAGC GAAGGTGGAC CATCGCATCA GCGATCGTCA GAACATGTTC
TATCGCTATG CGTGGGAAAA ATGGACGAAC CCCAACGATC AGCTTGGATT TCCGTTCGTG
GCTGACGCCA GCCAATCCAC TTCTGACAGC AACAGTTTTC ATGACTTTGT AGCGCAGCAC
AACTACACGA TCTCGCCGAC CAAGGTGAAT TCGTTCAACT TCCACCTCCA GGACTTCACC
AATGACATCC TTCCGGCGCC GGGTCGCACC TTCACCTACG ACGTGGCGGG AGGAGGGACT
GCAACCAATC CAGAAATCTG CTTCGGTATC GGCGGCGGAT GCGGCGGTGG CGTGCCGGAA
GTCGGCAACA ACGTCAACGT TCCCCAAGAA ACCCTGATTC GCAAATACCA GTTCCGCGAC
GACTTCACGT GGGTGCATCG CAACCACAAT ATGAAGATGG GCGTGAACTG GGATTACGTG
GACGTAATGG GCGGATTCTT CTACTTCGGC GCCAACGGCT ACCAAGTCAT CTTCCAGGAT
GATCCGAAGA CGATCCTGGC GAACCCTGCG GCGTATCCGG ACGGCTTCTC AACCCGCGGC
GCCGTCGGTG AACTCACCTA CAACGGCGGC TCCGGTTCGA CCGCACAGCC TCCATCGCAC
CAGTTAGCGT TCTACTTCCA GGATGATTGG AAGGTCACCA ACCGCTTCAC GCTGAACGCC
GGCGTGCGCT GGGACGCGAA CCCGCTGTTC CTCATCCCGC AGCTCACGAA CAACTTTAGC
AGCACGAACC GCACTGTCCG AGTCCTGCGC GACGTACTCG CCGCGAACCT GTCTGATCCA
GCGGCACAAG CAGGCGTTCA AAGGGCGGAC TACCTCGCCG GCAATACCAG TCTGGCTAGC
AAGAACACCG CCGACTGGAA GGAATTCCAG CCTCGTATTG GTTTCTCCTG GGATCCCACC
GGTTCGGGCA AGAGCGTCAT CCGCGGCGGC TATGGCATCG CACGCGACAC CATCTTCCAG
AACCTGACGC TCTTCGCGGT TCAGGAAACC AATCCAACCA TTTACAACAC GATCATCGAC
TACTTCCCGA GCCAAGCCCC GGGGTCTTGC CCTGCAGGCG GCACTGCCGA TCCAACCGAC
CTTTGCAACT TCCGCTTTGG CATCGATCCG CTTCCGGCTC CGCAAGCGGC GACTACGGAC
CTCGCACCCG GCGCCGTTGG CCGTATGCAG GACCCGCGTT TGACGGACCC GTGGTCGCAG
CAGATGTCCA TCGGCGGCGA ACGCCAGTTC GGCAACGACT ACGCGTTCGG CGCGGATTAC
TACCACGTGC TCGGCACCCA TGAACCACGC GTTCTGAACA TGAACCCGAA GATCGGATCG
ACCTGTGATC CGGCGTACGG CGGCGATCCT ACCAACCCAA CCTGCGTGAA CGGCGCGGGA
ACTCGGTTGA TGGACGCGGC CTTCTCGGAA GCACCAGACA GCCAGATCGC CGGCCAGAAC
CTTGGCATCG GGCGATTGGG CGCTATCTAC GATTACTCAA CCTCGAACCG CTCTCTGTAT
GACGGCATCA ACTTCCAATT ACGGAAGCGG ATGAGCCACC ACTTCCAATT CCAGGCGAGC
TACGTTCTCT CCTGGGCGCG GTCGTGGGGT GGACGCCCGA CGTCGTCCTA TAGCGGCAGC
GGCGTCAATG TCACTCCGGA GCAGCAGTTC GCTTCCAACG AATTCAACTA CAGCAGCTTT
GACGAGCGCC ATCGCTTCAC GTTGAGCGGC GTCTTCCAAC TGCCGTGGGG ATTCGAAGTT
GCACCACTGG TTCAGGCCGC ATCGGCACGC CCGTATGATT TCATCGCTGG TTCGGACATT
AACGGCGACG GACGTTCCAC GATTGACCGC GCTTGCGTCG GTAGCACTCC GGGCAATCCG
ATCTTCACTA AGGGTTGCAC CATGCTCAAG CCGGACACCC TGCGTGGCGA CCCGTTCTTC
CAGATTGATA CGCGCGTCGC TAAAGCATTC AAGTTCAACG AACACATGAC GTTGCAGTTG
ATTTGGGAGT TCTACAACAT CGGCAACGTA AACAACTTCT GTAACTACTA CTTCAATAAC
GCCAGCCAGT CCAACTTTGG AACGCCGCAG GGATACTGCG GTGGCCAGGG CGGTCCGGCC
TTTACGGGCC CGTTCCGTCA GCAGTTCGGC TTCCGTTTCG AGTTCTAG
 
Protein sequence
MKRAVLFTFL ILLVISSIAF GQQTTGQING TLVDSSSAVV PNATVTAKNV DNGLTRSTKS 
STTGGYTIND LPPGTYTITT EAPGFAKTVN ERVPLLVGQA LTLNFTLKTG GANETITVTE
EAPLIESTRS DIGGSVSPLE VKELPIVDRN FAGLMATVPG VRPAEAFDPT KTRSGNVSVN
GSDGRSIDYN VDGGDNKDVV IGGIVQNFTM EGIQEFQVTT DRYTAESGRA AAAVVNVISK
SGTNAFHGTA FSLFQNSGLN SNSYFNEIAG NPKNKFHRYQ FGGSAGGPII KDKLFFFGAY
EQKREPQDIG VDPSAFDNLT LFAAAFPDYA VPIRKLDYSY LDQQLTAKVD HRISDRQNMF
YRYAWEKWTN PNDQLGFPFV ADASQSTSDS NSFHDFVAQH NYTISPTKVN SFNFHLQDFT
NDILPAPGRT FTYDVAGGGT ATNPEICFGI GGGCGGGVPE VGNNVNVPQE TLIRKYQFRD
DFTWVHRNHN MKMGVNWDYV DVMGGFFYFG ANGYQVIFQD DPKTILANPA AYPDGFSTRG
AVGELTYNGG SGSTAQPPSH QLAFYFQDDW KVTNRFTLNA GVRWDANPLF LIPQLTNNFS
STNRTVRVLR DVLAANLSDP AAQAGVQRAD YLAGNTSLAS KNTADWKEFQ PRIGFSWDPT
GSGKSVIRGG YGIARDTIFQ NLTLFAVQET NPTIYNTIID YFPSQAPGSC PAGGTADPTD
LCNFRFGIDP LPAPQAATTD LAPGAVGRMQ DPRLTDPWSQ QMSIGGERQF GNDYAFGADY
YHVLGTHEPR VLNMNPKIGS TCDPAYGGDP TNPTCVNGAG TRLMDAAFSE APDSQIAGQN
LGIGRLGAIY DYSTSNRSLY DGINFQLRKR MSHHFQFQAS YVLSWARSWG GRPTSSYSGS
GVNVTPEQQF ASNEFNYSSF DERHRFTLSG VFQLPWGFEV APLVQAASAR PYDFIAGSDI
NGDGRSTIDR ACVGSTPGNP IFTKGCTMLK PDTLRGDPFF QIDTRVAKAF KFNEHMTLQL
IWEFYNIGNV NNFCNYYFNN ASQSNFGTPQ GYCGGQGGPA FTGPFRQQFG FRFEF