Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Taci_0207 |
Symbol | |
ID | 8630017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermanaerovibrio acidaminovorans DSM 6589 |
Kingdom | Bacteria |
Replicon accession | NC_013522 |
Strand | - |
Start bp | 227345 |
End bp | 228511 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | cysteine desulfurase NifS |
Protein accession | YP_003316728 |
Protein GI | 269791824 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000531678 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGGC GGGTCTACAT GGATCACTCG GCCACCACCC CGGTGGATCC CAAGGTGGTG GAGGCCATGG TGCCCTTCTT CACCGAGCAC TACGGGAACC CCAACAGCCT GCATGGCTGG GGGAAGGAGG CCCGATCCGC CATGGACCGG GCGAGGGGTC AGGTGGCGTC CCTAATAGGG GCGGAGCCCA AGGAGATCAT ATTCACCGGC GGCGGGAGCG AGGCGGACAA CCTGGCCATA AAGGGCGCCG CCTGGGCCAT GAGGGACCGG GGCAGGCACA TAATAACCAG CGCCATAGAG CACCACGCGG TGCTGGACGC CTTCAAGTGG CTGGGCAAGA ACGGGTTCGA GGTGACGGTG CTGCCCGTGG ACCGGGAGGG GCTGGTCTCC CCCGAGGACC TGAGGGGGGC CATAAGGCCT GACACCACCC TGGTGTCCAT CATGTTCGCC AACAACGAGA TCGGCACCGT CCAGCCCATC GGGGAGCTGG GGGGGATCTG CCGGGAGCGG GGGGTGCTCT TCCACACCGA CGGGGTCCAG GCGGCGGGTC ACATGCCACT GGACGTCCGG TCCTTGCCGG TGGACATGCT GACCATGGCG GCCCACAAGA TGTACGGTCC CAAGGGGATA GGGGCCCTGT ACGTCCGGAG GGGCATAAAG CTCACCCCCC TGATCCACGG GGGAGGCCAG GAGTTCGGCC TCAGGTCCGG CACGGAGAAC GTGCCTTTGG TGGTGGGCTT CGGCATGGCG GCGGAGATGG CGGCTGAGAG GCTCTCAAAG GGGGAGCACG AGCGGGAGGC CCAGCTCCGG GACCGGCTTA TCGAGGGGGT GCTCTCGTCG GTGGAGGACT CCTTCGTCAC CGGAAGCAGG ACCCATCGAC TGCCCTTTCA CGCCAGCTTC TGCATCCCCC GGGTGGAAGG GGAGGCCATG GTGTTGCGGC TGGACTTCGC CGGGGTGGCG GCCTCCAGCG GGTCCGCCTG CACCTCCGGC AGCCTTGACC CCAGCCACGT GCTATTGTCC ATAGGGCTCC CCCACGAGCT GGCCCACGGG TCGCTGCGGC TCACCCTTGG GAAGGACACC ACCCAGGAGG ACGTGGATCA CGTGCTGTCG GTCCTTCCCC CCATAATAAA GACCCTAAGG GACATGTCCC CCTACAAGAA GGCGTAA
|
Protein sequence | MSRRVYMDHS ATTPVDPKVV EAMVPFFTEH YGNPNSLHGW GKEARSAMDR ARGQVASLIG AEPKEIIFTG GGSEADNLAI KGAAWAMRDR GRHIITSAIE HHAVLDAFKW LGKNGFEVTV LPVDREGLVS PEDLRGAIRP DTTLVSIMFA NNEIGTVQPI GELGGICRER GVLFHTDGVQ AAGHMPLDVR SLPVDMLTMA AHKMYGPKGI GALYVRRGIK LTPLIHGGGQ EFGLRSGTEN VPLVVGFGMA AEMAAERLSK GEHEREAQLR DRLIEGVLSS VEDSFVTGSR THRLPFHASF CIPRVEGEAM VLRLDFAGVA ASSGSACTSG SLDPSHVLLS IGLPHELAHG SLRLTLGKDT TQEDVDHVLS VLPPIIKTLR DMSPYKKA
|
| |