Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2599 |
Symbol | |
ID | 8603936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3027378 |
End bp | 3028976 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | sulphate transporter |
Protein accession | YP_003300192 |
Protein GI | 269126822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0015803 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACT GCCATCCACC GACCCATGAC CGTCCGGCGC TCAAAGCGGC GGACGTCCGA TCCGGTTTCC TGGTCTTCCT CATAGCGCTC CCCCTGTGCC TGGGAATCGC GCACGCCAGT GGTTTCCCGC CCGTCGCCGG AGTGGTGACG GCGATCGTCG GCGCCGTCGC CGGCCTGTTC GGCGGCTCCC CGCTGACCGT CAAGGGCCCG GCCGCGGGCC TGATCGTGGT CTCCCTCGGC GCGGTCGCCG AGCTGGGCGG CGGCGATCCG GTCGCCGGGT ACCGGCGCGC GCTGGCCGCC GGCGCCGTCG CCGCGGCCCT GCAGATCCTG CTCGCCCTGC GCGGCGTCGC CCGCGCCGGC GTGGCCGTCT CACCGTCGGT CGTGCACGGC ATGCTCGCGG CGATCGGCGT CATCATCATC GCCAAGCAGG TGCACGTCGC AGTCGGCGTC CGCCCCGAAG GCGAGCGCAC CTTCGACCTG CTCGCCGAGA TCCCGCGCAG CCTGGCGCAC GCCAACCCCG AGATCGTGCT GCTGGGCGCG ACCGCCCTGC TGATCATGGC CGGCATGCCG CTGGTCCGCG CCCGATGGGC CCGCGCGGTG CCGGCGCCCT TGGTCGCGCT CGCCGCGACG GTGCCGCTGG GACTGGCCTT CCACCTGGGC AGCCCGCACG ACTACCACTT CCTCGGGCGG ATTCACCATC TGGGACCCGA GCACCTCGTC CAGATCCCCG GCACCCTGCT CGGCGCCGTC GCCCTCCCCG ACTTCTCGAT GGTCCTCACC GCGGCGTCGC TGAAGTACAC GGTGATGTTC GCGCTGATCG GCACCATCGA GTCCACCCTG ACCGTGCTGG CGGTCGGCTC GATGGACCCC CGCCGCCGGG CCGCGGACCT GGACCGCGAC CTGCTCGCCG TCGGCTGCGG CAACCTGGCC GCGGCGCTGC TGGGCGGCCT GCCGATGATC TCTGAGATCG TCCGCAGCAA GGCCAACGTC GACGCCGGGG CCGTCTCGCG CTGGTCCAAC TTCTTCCACG GGGCCTTCTT GCTGCTGTTC GTGGCGCTGG CGCCCGGTCT GCTGCAGGCC GTTCCGCTGG CGGTCCTGGC CGCCATGCTG GTCTACACCG GCGCCCGGCT GGCCTCGCCG CGCGAGCTGC TGCGCGTCGG CAGGATCGGC CCCGACCAGC TCGCGCTCTT TGCGATCACG ATGCTGGTCA CGCTCGCCAC CGACCTGCTG ATCGGCGTGG CGGCCGGGCT GGCCGCCGAA CTGCTGCTGC ACCTGCGCCG GGGCGTGCCG GTGGGCGCCT TCCTGCGCCC CCGGGCCGAG GCGCTGCGGG ACGGCGGCAC GCTGCATGTG CGCATCCCCA GGGCCGCGGT CTTCCCCGCC CTGCTGCCGC TGCACCGGAC GGTCAGCGGG GCCGACGGCG TCACCAGGGT CGTCATCGAC GTCCGGGACG CGGCCGTGGT CGACCACACC TTCCTGCGCA GGGTCGCCGA CCTGTCCGGG CAGTGGCCCA TCACCGCCCT CACCTTCCGG GGGCTGGACC GGCTGCGTCC GGTCTCCGCT CACCCGCAGG CCACCCGGCG GCGCAGGAGG CGGTCATGA
|
Protein sequence | MSNCHPPTHD RPALKAADVR SGFLVFLIAL PLCLGIAHAS GFPPVAGVVT AIVGAVAGLF GGSPLTVKGP AAGLIVVSLG AVAELGGGDP VAGYRRALAA GAVAAALQIL LALRGVARAG VAVSPSVVHG MLAAIGVIII AKQVHVAVGV RPEGERTFDL LAEIPRSLAH ANPEIVLLGA TALLIMAGMP LVRARWARAV PAPLVALAAT VPLGLAFHLG SPHDYHFLGR IHHLGPEHLV QIPGTLLGAV ALPDFSMVLT AASLKYTVMF ALIGTIESTL TVLAVGSMDP RRRAADLDRD LLAVGCGNLA AALLGGLPMI SEIVRSKANV DAGAVSRWSN FFHGAFLLLF VALAPGLLQA VPLAVLAAML VYTGARLASP RELLRVGRIG PDQLALFAIT MLVTLATDLL IGVAAGLAAE LLLHLRRGVP VGAFLRPRAE ALRDGGTLHV RIPRAAVFPA LLPLHRTVSG ADGVTRVVID VRDAAVVDHT FLRRVADLSG QWPITALTFR GLDRLRPVSA HPQATRRRRR RS
|
| |