Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VIBHAR_04996 |
Symbol | |
ID | 5558047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio harveyi ATCC BAA-1116 |
Kingdom | Bacteria |
Replicon accession | NC_009784 |
Strand | + |
Start bp | 249987 |
End bp | 251417 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640909474 |
Product | cysteine desulfurase |
Protein accession | YP_001447130 |
Protein GI | 156976224 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAC TTCCGACAAG GATCGAAAAG ATGAGCGACA AGGAATCTAT TCAGACTAAT CGCAGCCGCC GCAATTTTTT AAAAGGTGCA ACTGGTTCAG TTGTGGCTGG CGTCAGTGCT TCAGCGTTTT CTGGTGCTGT TCAAGCAAAG GAACCGAATA TCGACTGGAG CCGTCTAGGT AAAAAGAACG AGAAGCGCTT TTGGCGTAAA GTTCAAAAGC AGTTTGTATT GGATAAACGA ACGACTTACA TGAATATTGG TACGACGGGT TCGATGCCAA AACATGTTCT TGAAGGATAC GAAGACAACA ACAAAATCGT TGCTAAGTAC CCGTGGGATA TGAAAGACAA GTTTGGCGCT TGGCCTCATG TTTCTGAAAT GGTAACAGAC GTTGCACCAG GCTTTGGTGC TAATCCAGAC GAAATCATTT TGAGTCGTAA CACTACCGAC GGTTTATGTT CAATCATCAA CGGTCTGCAT TTCGAGCCTG GTGATGTAAT CCTAACTACT CACCACGAAC ACGTTGCCGC AACGTCACCA ATGAATGTGG CAAAACACCG CTTCGGTGTC GATGTCGTTG AAATTCAACT TCCTGTTTTC ACTGGCTCGG AAGAGGTTTC AGAAGAAGAT TACATTCAAG CCTTCCGTGA AGCGATTGAA GCGCACCACA ATGTTCGCCT CATCGTGTTC TCTCAGATTA CTTACAAAAC AGGTACAACG CTGCCAGCCA AAGCGATTTG TTCTCTAGCA AAACAGCACG GTATCCCAAC ATTAGTTGAT GGTGCGCACA CTGTCGGTAT GTTCGACTTA GACTTCCACG ATATGGATTG TGACTTCTAT GCAGGATCTG GCCATAAGTG GCAGTGTGGT CCGGGCGCGA CAGGTATCTT GTATGTACGT GATAACGGCA ACCGTTTGAA TGAGTACTGG AGCGATCGTG AAAATCCACT GTGGTTGATC AATTCTTCTC TTTCTCACGC TGATCATCTA GGCAAGCAAT TGCAAATGCA ATACATTGGT AACGATAACT ACCCGGCAAA ACAAGCACTT GCTGATAGCT GTAAGATGTG GGATGAGATT GGCCGTGACC GCATTCAAGA GCGTGTACTA GAGCTGAGCG ATCTATGTAA GACGCTGCTT AACGAAGCGT TACCACATGC TCAGATGTAT TCGCCAAACG TGGAAGGCTT AACCAGTGGT CTAACAACGT TTAACCCATT AAGCGATGTG ACAGATAAAG AGCGATTGAC TCTGTTCCGT GACCGTCTTC GTGAAGAATA CGGATACATC ATCCGTACAA CAAACTTCAA GTTGTACAAA GACGACGCTT ACGAAACGCA AGCACTGCGC ATCTCGACTC ATTTATTCCA TGATGAGAAA GATGTAGAAG GTCTTGTCGA AGCGATTACG GATCTTTACT ACTCGTTCTA A
|
Protein sequence | MAKLPTRIEK MSDKESIQTN RSRRNFLKGA TGSVVAGVSA SAFSGAVQAK EPNIDWSRLG KKNEKRFWRK VQKQFVLDKR TTYMNIGTTG SMPKHVLEGY EDNNKIVAKY PWDMKDKFGA WPHVSEMVTD VAPGFGANPD EIILSRNTTD GLCSIINGLH FEPGDVILTT HHEHVAATSP MNVAKHRFGV DVVEIQLPVF TGSEEVSEED YIQAFREAIE AHHNVRLIVF SQITYKTGTT LPAKAICSLA KQHGIPTLVD GAHTVGMFDL DFHDMDCDFY AGSGHKWQCG PGATGILYVR DNGNRLNEYW SDRENPLWLI NSSLSHADHL GKQLQMQYIG NDNYPAKQAL ADSCKMWDEI GRDRIQERVL ELSDLCKTLL NEALPHAQMY SPNVEGLTSG LTTFNPLSDV TDKERLTLFR DRLREEYGYI IRTTNFKLYK DDAYETQALR ISTHLFHDEK DVEGLVEAIT DLYYSF
|
| |