Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0028 |
Symbol | irgA |
ID | 5137888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 22921 |
End bp | 24879 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640531488 |
Product | enterobactin receptor protein |
Protein accession | YP_001216002 |
Protein GI | 147674899 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000507235 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGAT TCAATCCATC CCCCGTCAGT TTATCTGTGA CACTAGGCTT AATGTTTTCG GCTAGCGCTT TTGCTCAAGA CGCGACGAAA ACGGATGAAA CCATGGTGGT CACTGCGGCG GGATACGCGC AAGTGATTCA AAATGCACCA GCCAGTATCA GTGTGATTTC AAGAGAAGAT CTGGAATCTC GCTATTACCG TGATGTGACC GATGCGCTAA AAAGCGTACC GGGTGTGACA GTCACCGGAG GGGGCGATAC TACCGATATC AGCATTCGTG GTATGGGATC AAACTATACT CTTATCTTGG TGGATGGTAA GCGCCAAACC TCACGCCAGA CCCGTCCAAA CAGCGATGGC CCGGGCATTG AGCAAGGTTG GTTACCGCCA CTGCAAGCGA TTGAACGTAT CGAGGTGATC CGTGGCCCGA TGTCTACGCT GTACGGCTCG GATGCGATTG GCGGCGTGAT CAACATCATC ACTCGTAAAG ATCAGCAGCA GTGGTCAGGC AATGTGCAGC TATCGACCGT GGTACAAGAA AATCGCGCCT CAGGCGATGA ACAAAGCGCT AATTTCTTCG TAACAGGACC TTTAAGTGAT GCTCTATCGC TGCAAGTCTA CGGACAAACC ACGCAACGCG ATGAAGACGA AATTGAGCAT GGTTACGGCG ATAAATCACT GCGCAGCTTA ACCTCAAAGC TCAACTACCA GCTTAATCCA GATCATCAAC TGCAACTGGA AGCCGGTGTA TCAGCCCAAG ATCGCGAAAA TAACGTGGGC AAGTCGGCGC AATCCAGCGG ATGTCGCGGT ACATGCAGCA ATACCGATAA CCAGTACCGT CGCAACCATG TTGCCGTCTC TCACCAAGGG GATTGGCAGG GTGTTGGGCA GTCTGATACT TATCTGCAAT ATGAAGAGAA CACCAATAAA TCTCGTGAGA TGAGCATTGA TAATACGGTG TTCAAATCCA CGTTAGTGGC TCCTATTGGT GAGCATATGT TGAGCTTTGG TGTGGAAGGT AAACACGAAA GTTTAGAGGA TAAAACCTCC AACAAAATTT CGTCTCGCAC GCATATCTCG AACACTCAAT GGGCGGGATT CATTGAAGAT GAATGGGCGC TCGCAGAGCA ATTCCGCCTG ACCTTTGGTG GCCGTCTGGA TCATGATAAA AACTACGGCA GCCACTTCAG CCCACGTGTG TATGGCGTAT GGAATCTCGA TCCTCTATGG ACAGTAAAAG GTGGTGTTTC TACTGGCTTC CGTGCACCGC AATTACGTGA AGTGACGCCG GATTGGGGGC AAGTTAGTGG CGGCGGTAAT ATTTACGGCA ACCCGGATTT ACAGCCTGAA ACTTCGATTA ACAAAGAACT GAGTTTGATG TACAGCACTG GCTCAGGTCT GGCGGCATCG TTAACCGCTT TCCATAATGA TTTTAAAGAT AAAATCACCC GCGTCGCTTG CCCTGCAAAT ATTTGTACTG CCGGCCCCAA CCAATGGGGT GCAACTCCTA CTTATCGTGT GAATATTGAT GAAGCGGAAA CCTATGGTGC AGAAGCTACG TTAAGTCTGC CGATTACCGA AAGCGTTGAG TTATCGTCTA GCTACACGTA CACCCATTCG GAGCAAAAAT CAGGGAATTT TGCTGGCCGT CCGCTACTGC AGTTACCTAA ACATCTGTTC AATGCCAACT TGAGCTGGCA AACCACAGAT CGCCTCAATA GCTGGGCTAA CCTCAACTAT CGTGGTAAGG AGATGCAGCC GGAGGGTGGG GCATCGAACG ATGATTTCAT CGCGCCAAGC TACACCTTTA TCGATACTGG CGTGACTTAC GCACTCACCG ATACCGCAAC GATTAAAGCT GCGGTGTACA ACCTGTTTGA TCAAGAGGTG AATTACGCGG AGTATGGCTA TGTAGAAGAT GGCCGCCGCT ACTGGTTGGG TTTAGACATC GCCTTCTAA
|
Protein sequence | MSRFNPSPVS LSVTLGLMFS ASAFAQDATK TDETMVVTAA GYAQVIQNAP ASISVISRED LESRYYRDVT DALKSVPGVT VTGGGDTTDI SIRGMGSNYT LILVDGKRQT SRQTRPNSDG PGIEQGWLPP LQAIERIEVI RGPMSTLYGS DAIGGVINII TRKDQQQWSG NVQLSTVVQE NRASGDEQSA NFFVTGPLSD ALSLQVYGQT TQRDEDEIEH GYGDKSLRSL TSKLNYQLNP DHQLQLEAGV SAQDRENNVG KSAQSSGCRG TCSNTDNQYR RNHVAVSHQG DWQGVGQSDT YLQYEENTNK SREMSIDNTV FKSTLVAPIG EHMLSFGVEG KHESLEDKTS NKISSRTHIS NTQWAGFIED EWALAEQFRL TFGGRLDHDK NYGSHFSPRV YGVWNLDPLW TVKGGVSTGF RAPQLREVTP DWGQVSGGGN IYGNPDLQPE TSINKELSLM YSTGSGLAAS LTAFHNDFKD KITRVACPAN ICTAGPNQWG ATPTYRVNID EAETYGAEAT LSLPITESVE LSSSYTYTHS EQKSGNFAGR PLLQLPKHLF NANLSWQTTD RLNSWANLNY RGKEMQPEGG ASNDDFIAPS YTFIDTGVTY ALTDTATIKA AVYNLFDQEV NYAEYGYVED GRRYWLGLDI AF
|
| |