Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3033 |
Symbol | |
ID | 4444400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3399395 |
End bp | 3400426 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639690857 |
Product | hypothetical protein |
Protein accession | YP_832512 |
Protein GI | 116671579 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.233167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC ACCTGACCAT CATGAGTGCC GCGGCCGCCG TCGTCATCGC GATGACGGTG TCGGGCTGCG GCGGCGGCGC CGCAGGGGCA ACCTCGGCCG GCGGAAGTGC CGGCGGCGCC ACCGAGGTGA AGGAGCTCCG CTACCAGGGC TGGGCCAACA CGGTAACGCT GCCGGAACTT GCCCAGGACC TCGGCTACTT CGGCGACGTC AAGCTCAACT GGGTGGGCAA CACCATCAGC GGCCCGCAGG ACATCCAGTC CGCGGCCACC GGGCAGACGG ATTTTGGTGG CGCGTTCGCC GGAGCGGTGG TGAAGCTGGT GGAAGCCGGC GCCCCGGTCA AGGCCGTCAT CAACTACTAC GGCGAAGACG AGAAGACCTT CAACGGCTTC TACGTCAAGG AAGACAGTCC CATCCGCACG GCCCGGGACT TCATCGGCAA GAAGATCGCA GTGAACACCC TCGGAGCACA CGCGGACGCC GTCATCAACA CCTACCTGCA GAAGAACGGT CTGAGCGCCG AGGAAATCAA GCAGGTGCAG CTGGTGGTGG TGCCGCCCAA CGACACCGAG GAGGCCATCC GCCGCGGCCA GGTGGATGCC GGTTCGCTGG GCAGCATCCT GCAGGACAGG GCGATCGCAA ACGGCGGCCT GCGGTCGGTG TTCAGTGACG CGGAACTTTT CGGCACCTTC GCCGGCGGCC CCTACGTGCT GCGCACCGAC TTCATCGCGA AGAACCCAAA CACCACCCGC ACATTCACCA CCGGGGTGGC CAAGGCCATC GAATGGGAGC GGACCACGCC CCGCGAGGAA GTGATCGCCC GCTTTACCAG GATCCTGCAG GAACGCGGCC GCAACGAGAA CCCGGCAGCG CTGCAGTACT GGAAGAGCGT GGGCGTACCC GCCAAGGGCG AGATCAAGGA TGAGGATTTC ACCCGCTGGG GCAAGTGGCT CAAGGACACC GGAATCATCA AGGGCGAACT GGACCCGAAG AAGCTCTACA CCAACGAGTT CAACGCCCTG GTGACCGGAT GA
|
Protein sequence | MKRHLTIMSA AAAVVIAMTV SGCGGGAAGA TSAGGSAGGA TEVKELRYQG WANTVTLPEL AQDLGYFGDV KLNWVGNTIS GPQDIQSAAT GQTDFGGAFA GAVVKLVEAG APVKAVINYY GEDEKTFNGF YVKEDSPIRT ARDFIGKKIA VNTLGAHADA VINTYLQKNG LSAEEIKQVQ LVVVPPNDTE EAIRRGQVDA GSLGSILQDR AIANGGLRSV FSDAELFGTF AGGPYVLRTD FIAKNPNTTR TFTTGVAKAI EWERTTPREE VIARFTRILQ ERGRNENPAA LQYWKSVGVP AKGEIKDEDF TRWGKWLKDT GIIKGELDPK KLYTNEFNAL VTG
|
| |