Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DehaBAV1_0943 |
Symbol | |
ID | 5131672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides sp. BAV1 |
Kingdom | Bacteria |
Replicon accession | NC_009455 |
Strand | - |
Start bp | 928341 |
End bp | 930017 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640529867 |
Product | extracellular solute-binding protein |
Protein accession | YP_001214401 |
Protein GI | 147669583 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0446161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGGA AGTTATTATA CCTGCTGGCA GCCCTGTTAA TCATTGTGCC AATAGTGTTT AGCGGTTGTA CCAGTGACGA CAACGATGAC GACGGGGATG ACGGAACTGT CACCACTCCT CAGGTATTCC GTGTCAATCT GGCAGGTGAA CCCAACACCA TAGATCCCAA CAAGGCTTCT TGGGCTACGG AGAGATCTGT AATCATGCTA CTCTTTGTAG GTCTGCTGGA CTTTAACTCA GACCTGTCCC TAAAGGCAGC GTGTGCCCAG GAAATTCCCA CCGTGGCTAA CGGGGGTATT TCAGCAGATG GTTTGACCTA TACCTTCAAA GTCAAATCAA ACGTGACCTG GAGCGATGGC TCAAAGGTTA CCGCCCATGA CTTTGAGTAT AGCATCAAGA GAATGCTGGA TCCCGACACA GCTGCAGAAT ATGCATCTTT CTACTTTGAT ATTGTAGGTG CGGCGGCATT TAATGCTGCG GCCAGTGCTG ATGCAGCTAC CAAGACTGCC CTGCGGAATG CTGTGGGGGT GACAGCGGTA GACGATACCA GTCTGCGCAT TACCCTGAAT CAGACCCGCC CCACCTTCCT GTCTATTATG GCTCTTTGGC CTACCTCACC GGTAAAGGAA AGCGTTATTA CTGCCAAGGG TAATTTGTGG ACTGAAGCCG GCAACCTTAT AGGTAACGGG CCGTATACCC TGAAAGAATG GGTGCATCAG GACCACATGA CGTTCACTCT CAATACCAAT TATTGGGATA CCAAGCCCAC CCTGACTGAG ATTAAGTACC TGATGATTCT GGATGCAACC CAGGAATTAT CGGCTTACAA GAATGGTGAG CTTGATATGG CCAGAGTCCC GGTAGGCACG GAAACAGCTA CTTTGGCTGA TCCGGTTTAC GGTAAGCAAG TGGTACGGAA CAATGACCTT ACTACTTTTG CTTTCCAGTT CAATGTTAAT AAAGCCCCCT TTGATAATCT GCTGGTACGC GAGGCTATGT CCTGTGCCAT TGACCGGGTG GCCTTTGTGG AGCAGGTTAG AGGCGGGGTA GGTACTCCGG CTTATTCGTG GATTCCGCCC GGCATGCCTG GTTACGATGC TGATTTGGGC AAAGATTTTG CTTTTAATGT CACCAAGGCC AAACAGCTTT TGGCTGATGC CGGCTATCCT AATGGGGTTG GCATGCCCGA ACTCAAATTC CAGTATGCAG ATACAGCGAG CAACCGAACT ATTGCCCAGT TCCTGCAGGC TCAGCTAAAG ACCAATTTAA ATCTTGATCT TACCCTTGAG CCTATGGAAC CGGCAGCCTT TAGTGCCTTT GTGAACAGCG AACAGCATAC TTGGGCCTGG TTTGGCTGGG GTGCTGACTA TCCTGATCCG GACAACTGGC TACCTGACCT GTTCGGAACC GGTGGTGGCA ACAACCATAC CGGATATTCA AATCCCGCTT TTGATACGCT GGCCAGACAA GCCATGATGG AACTTGATAA TACTCTGCGC CTTCAAATGT GGGCTCAGGC TCAGGAAATT GTTATGGCAG ATATGCCCAT TGTAACCATG TTCTACCGTG AACGGTTCTA TGTTGTACAG CCGTATGTTA AAGGGCTGGA ACCAACCGGT ATGGATGGCG GCATAATGGG TGATACTTCA TTGGTCAATG TCTCTATTGT GAAGTAG
|
Protein sequence | MKGKLLYLLA ALLIIVPIVF SGCTSDDNDD DGDDGTVTTP QVFRVNLAGE PNTIDPNKAS WATERSVIML LFVGLLDFNS DLSLKAACAQ EIPTVANGGI SADGLTYTFK VKSNVTWSDG SKVTAHDFEY SIKRMLDPDT AAEYASFYFD IVGAAAFNAA ASADAATKTA LRNAVGVTAV DDTSLRITLN QTRPTFLSIM ALWPTSPVKE SVITAKGNLW TEAGNLIGNG PYTLKEWVHQ DHMTFTLNTN YWDTKPTLTE IKYLMILDAT QELSAYKNGE LDMARVPVGT ETATLADPVY GKQVVRNNDL TTFAFQFNVN KAPFDNLLVR EAMSCAIDRV AFVEQVRGGV GTPAYSWIPP GMPGYDADLG KDFAFNVTKA KQLLADAGYP NGVGMPELKF QYADTASNRT IAQFLQAQLK TNLNLDLTLE PMEPAAFSAF VNSEQHTWAW FGWGADYPDP DNWLPDLFGT GGGNNHTGYS NPAFDTLARQ AMMELDNTLR LQMWAQAQEI VMADMPIVTM FYRERFYVVQ PYVKGLEPTG MDGGIMGDTS LVNVSIVK
|
| |