Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3631 |
Symbol | |
ID | 4693660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 4014883 |
End bp | 4017738 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639851386 |
Product | hypothetical protein |
Protein accession | YP_998365 |
Protein GI | 121610558 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCTC CCCGGTGTCG CCCTGACCAG GGATCGCAGC GTCAGCCGGT CTGGCACCAT GGCTTGCCCT CAGGCAATGA GTTCGGGGGC GCCCTCGTCC ATCCACTGGA TACCATGTTT TCATTTTTTC CCCGCAGCAA AATGTCCGAA GCACCTGAAG TGAATCCGGC CCCGGCCAAA ACCCACGAGC CGCAGCCACT CGATGCGCTG ACCGGTGGCG CCTTTTCCGC CGCCACCTCC GGCGAGCGCG CCGCCCGCAT TCGCGACTGG CTGCTCACCG AGCCCGCGCC GGAACAATTG CAGGAAGTCT TCAAGGAACT GAGCCGGCGT GACAAGGGCG CCGCCCGTGC AGTGCGCGAG CGCCTGGACG AAATCCGCCG CGCCAAGGGG CAGCAAGCCA TCGCTGCCGA ATGGGCGGAA AAGGCCCAGG CCCTGCTGGC GACCCCCAAG CTCAACATTG CCGATGCGAT GGCCTGGCAG CGCGATGCCG CCAAGGCCGG CGCCCCGCTG TCGCGCGAGC CGCTGTCGCT GCTCAAGATA CAGGTGGCGC AGCGGGTGCA GGTCATCGAA GACCTGCAGC ACCGCGTGCA GGTGCAGCGC GAAGCGGCGG TGCTGCTGGC CCAGCGCATC GAGGTGCTGT CCACCAAGCC CTGGCGCGAT GCGCTGGCCG CGCTCGACGC GCTGCGCGCC GATGTCGCCC ACTGGCAGGA GCAGGCCCTG CAACTGAGCA GCGATGCCAG TTGGGCCAGC GTGGACCTGC GCTTTCCGCC GCTGCTGGAG GCATCGCGCA GCCAACTGCT GGTGGTGTGG GAGGCGTTCC AGCCGGCGCT GGCCCAGGCC GCTGCGGCCG CCGAAGACGC CGCTGCGCCG CTGCCGCCAG TGCCCGTGTG GGCCGACGAG TTGCGCGCCG CGCGCGGCTT GCTCGCCGAG GCCGCTGCCA AACCCGCCCG CGCCGCCCGG CCCAAGCCCG ACCCCGAAGT GCTGGCGCAA GCGACCGAAC TGGTGCGCGC AGCCCTGGCG ACGCTGGAAA AGGCAACCGC CGAAGGCCAT GGCAAGGCCA GCGCCGGCGC CGCAGCGGCG CTGCGCGCCG TGCTCAAGGC GCAGGGCAAG CATATCGATG ACAAACTGGA GCAGCAGGTG CATGCCGCGC TGGTGGCAGC CGGCGAACTC GAAGGCTGGC AGCGCTGGAG CACCGACCAA GTGCGCGAAG ACCTGGTGGC CAAGGCCGAG GCGCTGCTCA CCCGCCCCGA AGGCCAGACC CTGGGCGGAC GCAAGATGCA GGAAACCCTG CGCCATCTGC GCGACCAATG GAAACAGGCC GACCAAGGCG GCGCGCCGAA CCACGCGCTG TGGAAGCGGT TCGACGAGGC TTGCAACGCC GCGCACAAGG TGGTCGAGCT CTGGCTGGAC AATATCCGCA GCGAAGCCGC CGAGCACAAG GCGCAGCGCC TGGCCCTGAT CGAAGAGCTC AAGGACTGGG CGCAGCAGCC GGCCGCCTCG GGGGATTGGA AAGCCGTCCA CCGCGCGCTG CAGCAGTTTG GCGAGCGCTG GCGCGAAGGC GGCCATGTGG GCGAGAAGAT CTTCGCCGAG TTGCAGCCGC TGTGGAAGCA GGCCCTGGCC CTGGCGGCCG CGCCGTTGGA AGCGGCGCAA AAAGAGAGCC TGGCCCGGCG CCAGGCGATG ATCGAAGAGG CCAACGCGCT GGGCGCCGCA GCCACGCTGC GCATCGACGC CGTCAAGGCC CTGCAGCAGC GCTGGCAGGC CGAAGCGCAG ACCGTGCCGC TGGAGCGCAA GCATGAGCAA AAGCTCTGGG AGGCTTTCCG CAAACCGTTG GACGAAGCCT TCCAGCGCAA GTCGACCGAG CGCGAGCGCG CCGTCTCCGA ACTGAGCGCG CGCGACCGCA TGGTCCTCGA CGCGGCCAAT GCCCTGCAAG CGGCCAATGC CAGCGCTGAC GCGCAGCAGA TTCGCGCTGC GATGCAGGCG CTCGAAGCCG CCTTGCGCGG CCAGGCCCAG GCGGCGCAAG TGCTGGCCTC GGCGCCCAAG GATGCACAAG ATCAGGTACA GGTACAGGTA CAGGTGCAGG TGCAGGAAGA ATCTGCACAG GATGCGACGA AAAGCGTAGC CCATCAGACA CCGGAAGAAG GGGCTGACCC GGCCAGTGCG GCGGCGGCGC CTGCCGCCGC GCCCAAGCCT GCGCCACGCC CGGTGGTGGC CGTGCGTGGC GATGACCGCC CCGGCATGAA GAAGGACATG CTGCCCGCCC CGGGCCGGGG TGGGCGTCCC GGCGAGCGCC GCGCCGACCG GCCGGGAGAG CGCGACCGCA ATCCGCGTGC CGATGGCCGC AGGGGCGAGC GCAGCGAGCG TGCTGGCCAC GGCGCCGACC GCCCGCTGCC CGAAGAGCGC GGTCCGCGCC TGGGCGACAC CGCTTTCCGC GCCCAGCGCG ACGCGATGGA GCAGGCCCAA CTGGCGCTGA AGAAACTGGC CGCGTTGGCC CATGGCGAGG CGCTGACCCA GTTGCTCACG GCCTGGCAAC AGCGCGACGC AACCCAGGTG CCCGGCACGC AGGAGCTTGG CGTGCCGGCC GCGATGCGCA GCGCCTGGAC GCAAGCGCTG TCAGCGCCAC CCCGGGGCGA TGCGTCCGAA GCGCTGCTGC GCCTGGAGAT AGCGGCACAA ACGCCCACCC CCGCCGAGCA CATCGACGCG CGCCGGATGT TGCAACTGCA ATTGCTCACG CGCCGCAACG ACCCGGGCCC CGCCCAGACC TGGGGCCAGG ACACGGCCCT GGTGCTGGCC AGTGCCAACG ACGCCGCCAA CACCCACCGG CTGCAACGGG CGTTGAAGGC ATTGCTGCGC AAGTAG
|
Protein sequence | MGAPRCRPDQ GSQRQPVWHH GLPSGNEFGG ALVHPLDTMF SFFPRSKMSE APEVNPAPAK THEPQPLDAL TGGAFSAATS GERAARIRDW LLTEPAPEQL QEVFKELSRR DKGAARAVRE RLDEIRRAKG QQAIAAEWAE KAQALLATPK LNIADAMAWQ RDAAKAGAPL SREPLSLLKI QVAQRVQVIE DLQHRVQVQR EAAVLLAQRI EVLSTKPWRD ALAALDALRA DVAHWQEQAL QLSSDASWAS VDLRFPPLLE ASRSQLLVVW EAFQPALAQA AAAAEDAAAP LPPVPVWADE LRAARGLLAE AAAKPARAAR PKPDPEVLAQ ATELVRAALA TLEKATAEGH GKASAGAAAA LRAVLKAQGK HIDDKLEQQV HAALVAAGEL EGWQRWSTDQ VREDLVAKAE ALLTRPEGQT LGGRKMQETL RHLRDQWKQA DQGGAPNHAL WKRFDEACNA AHKVVELWLD NIRSEAAEHK AQRLALIEEL KDWAQQPAAS GDWKAVHRAL QQFGERWREG GHVGEKIFAE LQPLWKQALA LAAAPLEAAQ KESLARRQAM IEEANALGAA ATLRIDAVKA LQQRWQAEAQ TVPLERKHEQ KLWEAFRKPL DEAFQRKSTE RERAVSELSA RDRMVLDAAN ALQAANASAD AQQIRAAMQA LEAALRGQAQ AAQVLASAPK DAQDQVQVQV QVQVQEESAQ DATKSVAHQT PEEGADPASA AAAPAAAPKP APRPVVAVRG DDRPGMKKDM LPAPGRGGRP GERRADRPGE RDRNPRADGR RGERSERAGH GADRPLPEER GPRLGDTAFR AQRDAMEQAQ LALKKLAALA HGEALTQLLT AWQQRDATQV PGTQELGVPA AMRSAWTQAL SAPPRGDASE ALLRLEIAAQ TPTPAEHIDA RRMLQLQLLT RRNDPGPAQT WGQDTALVLA SANDAANTHR LQRALKALLR K
|
| |