Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4004 |
Symbol | |
ID | 4694641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 4394418 |
End bp | 4395437 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639851752 |
Product | hypothetical protein |
Protein accession | YP_998728 |
Protein GI | 121610921 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.506172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0371737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGCAT TCCGATCTTC CTCCCTCCAC CACCATCACC ATCACCACCA CCGGCGGCGG CTGTTGGCCG CACTGGGCAC CGCCGCGTGT CTTGCGGCCG CTGCCCCGGC AGGCCAGGCG CAAGGCTATC CGAACAAGCC GGTCCGGCTG GTCGTGGCGT ACCCGCCGGG CGGCGCAACG GATATCGTGG CGCGCGTGCT GGCGCAAAAG CTGTCGGAGC AGACCGGCCA GCAATTCATC GTCGACAACC GGCCCGGCGC GGGCGGCAAC ACGGGTGCCG AGTGGGCGGC GCGCAGCGCG CCCGATGGCT ACACGCTGGT GCTGGCAACC ACTGCGCATG CGATCAGCCC CGCGCTCTTC AAAAACCTCG GCTACAAGCT CGACAAAGAC TTTGCGCCCG TGTCGCAGCT CACCAGCGGC CCGCTCGTGA TCGTGGCGCA CCCCGGACTG CCAGCGAACG ACGTGACCGA GCTCATCGCG CTGGCCAAGG CCAGGCCGGG CGTGCTCAAT TTCGCGTCAT CGGGCAACGG CCAGTCGACC CACCTCGCGG CCGAACTGTT CGCCTCGATG GCCGGCGTGA AGATGGCGCA CATCCCGTAC AAGGGCAGCG CCCCGGCGCT GACCGACGTG ATGGGCGGCC AGGCGCAGCT GATGTTCGAC ACCATCCTCT CGGCCATGCC GCAGGTGAAG GCCGGCAAGC TCAAGGCGCT GGCCGTGACC AGCGCCAAGC GCTCGGGCGC GGCGCCCGAA CTGCCGACCG TGGCCGAGTC CGGCCTGCCG GGCTACGAGG CCATCGCCTG GAATGGCCTG CTGGCGCCGG CCGGCACGCC GCCGGAGGTG ATCGCGCGCC TGAACGCCGA ACTGAAGAAG GCGCTGGCGC TGCCCGAAGT GAAGGACAGG TTCGAGGCCC AGGGCTTTGC CGCCGCGTGG AACACGCCCG AGGCTTTCGG CGACTTCATG AACGCGCAGG TCAAGAAATG GGCGCAGGTG GTGCAGGTGT CGGGCGCCAC GCTGGACTAG
|
Protein sequence | MHAFRSSSLH HHHHHHHRRR LLAALGTAAC LAAAAPAGQA QGYPNKPVRL VVAYPPGGAT DIVARVLAQK LSEQTGQQFI VDNRPGAGGN TGAEWAARSA PDGYTLVLAT TAHAISPALF KNLGYKLDKD FAPVSQLTSG PLVIVAHPGL PANDVTELIA LAKARPGVLN FASSGNGQST HLAAELFASM AGVKMAHIPY KGSAPALTDV MGGQAQLMFD TILSAMPQVK AGKLKALAVT SAKRSGAAPE LPTVAESGLP GYEAIAWNGL LAPAGTPPEV IARLNAELKK ALALPEVKDR FEAQGFAAAW NTPEAFGDFM NAQVKKWAQV VQVSGATLD
|
| |