Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_3972 |
Symbol | |
ID | 4693949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 4356397 |
End bp | 4357812 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639851721 |
Product | ABC transporter nitrate-binding protein |
Protein accession | YP_998697 |
Protein GI | 121610890 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.338423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.174767 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAGAG AAATCGACAG CCCCCTCTTC CACACTGACG ACGCCGGGTA CGCAGGCAAC ATTTGCTCCT GCGGCAAACA TGCCAGCCAG ATGCAGATGG CGCATGGCGA TGGGCACCCG GCGGCGCAGC AACATCTGCG GCGAATACGT TCGCAGGATC CCGAACCATT GAGCAACGAT GTGGTGGAAG AATCCGTGTT GCGCGCACTG TTTCCGCAGG CAGCGCAGCG TCGCGGCTTC CTGCGTGCGG TCGGCGCCAA TACGGCGCGC GCGGCCATTG CCAGCATGTT TCCGTTGGAT GCGTTGCAGG CGATGGCGCA AGAAAAGAAC GGCGCCATCG AAAAGAAGGA CCTCAAAATC GGCTTCATCG CGCTTACCTG CGCAGCACCG CTGATCATGG CCGATCCGCT CGGCTTTTAC CGCAAGCAGG GCCTGAACGT GTCGCTCAAC AAAACCGCCG GTTGGGCGCT GATTCGCGAC AAGATGATCA ACAAGGAATA TGACGCATCG CATTTCCTGT CGCCGATGCC GCTGGCGATG TCGATCGGTG CGGGCAGCCA TCCGGTGCAG ATGCGCATCG CGACGATCCA GAACATCAAT GGCCAGGCCA TCACGCTGCA CGTCAAGCAC AAGGACAAAC GCAATCCCGG GCAGTGGAAT GGCTTCAGGT TTGCGGTGCC GTTCGAGTAT TCGATGCACA ACTTCCTGCT GCGCTATTAC CTTGCCGAAA ACGGACTCGA TCCGGATCGT GACGTGCAGA TCCGCGTCAC ACCGCCACCG GAAATGGTGG CCAATCTGCG CGCCGGCAAC ATCGACGGCT TCCTCGGTCC CGATCCGTTC AATCAGCGCG CGGTGTACGA CGAAGTCGGT TTCATCCATA TCCTCTCCAA GGAAATCTGG GACGGTCATC CATGCTGCGC GTTCGGCATG TCGGAAGATT TCGTCAAACA AAATCCGAAT ACCTTCGCGG CCTTGTTCCG CGCCGTGCTG ACTGCCGCAG CGATGGCGCG CGATCCGGCC AATCGTTCTC TGGTGGCCAA GGTGATTTCG CCGGCAGCCT ATTTGAATCA GCCGGAAACG GTGGTCGAGC AGGTGCTGAC CGGCCGCTTC GCTGACGGTC TCGGCAACGT GAAAACCGTG CCGGGCCGCG CCGACTTCGA TCCGGTGCCG TGGGATTCGA TGGCGGTGTG GATTTTGAGC CAGTTAAAGC GCTGGGGTTA TGTCAAAGGC GAGATCGATT ACAAAGGCAT CGCCGAGCGC GTGCTGCTGT TGACCGATGC CAAAAAATAC ATGAAGGAAC TCGGCCAGCC AGTGCCGGAC GGCGCCTATC GCAAACACAT GATCATGGGC AAGGAATTCG ACCCGGCCAA GGCCGACGCC TACGTCAACA GCTTTGCCAT CAAGAGGACG AGCTGA
|
Protein sequence | MPREIDSPLF HTDDAGYAGN ICSCGKHASQ MQMAHGDGHP AAQQHLRRIR SQDPEPLSND VVEESVLRAL FPQAAQRRGF LRAVGANTAR AAIASMFPLD ALQAMAQEKN GAIEKKDLKI GFIALTCAAP LIMADPLGFY RKQGLNVSLN KTAGWALIRD KMINKEYDAS HFLSPMPLAM SIGAGSHPVQ MRIATIQNIN GQAITLHVKH KDKRNPGQWN GFRFAVPFEY SMHNFLLRYY LAENGLDPDR DVQIRVTPPP EMVANLRAGN IDGFLGPDPF NQRAVYDEVG FIHILSKEIW DGHPCCAFGM SEDFVKQNPN TFAALFRAVL TAAAMARDPA NRSLVAKVIS PAAYLNQPET VVEQVLTGRF ADGLGNVKTV PGRADFDPVP WDSMAVWILS QLKRWGYVKG EIDYKGIAER VLLLTDAKKY MKELGQPVPD GAYRKHMIMG KEFDPAKADA YVNSFAIKRT S
|
| |