Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1903 |
Symbol | |
ID | 4445557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2141307 |
End bp | 2142551 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639689713 |
Product | hypothetical protein |
Protein accession | YP_831385 |
Protein GI | 116670452 |
COG category | [R] General function prediction only |
COG ID | [COG3970] Fumarylacetoacetate (FAA) hydrolase family protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGCA TGAAAATACA GCAAATTCTT CCGGCCGACC CGGCGAAGGC GATCCTCATC GGCCGCATAT GGGATCCGGT CTCCCAGGGC CCCAGAGTGG TCTCTGTCCG GGGTGAGGAC GTTTTCGATC TGAGCCCCGG GGTCCGCACT GTTGCAGAAC TCATGGAATA CGAGAACCCC CTGGAAGTGG TTCTTCAAAG AACTGACGCT CCGCGTTGGG ACCTCGCGGA GATCGTGGAA GCTTCGCAGG GGACGGACCG TAACCGGCCC CACCTGCTGG CGCCCATCGA CCTTCAGGTG GTCAAGGCCT GCGGGGTGAC GTTCGTTGAG AGCATGATCG AGCGGGTCAT CGAAGAGCGG TGCGGCGGTG ATTTCACCCG TGCCGCAGAA GTCCGCAAGG CTGTCGCCGA CGTCCTCGGC GGCAGCTTGG ACTCCGTCCG CCCCGGGTCG GAGCAGGCGC GGGAGGCCAA GAGAATACTG TCCGCCCAGG GCATGTGGTC ACAGTACCTT GAGGTCGGTC TTGGTCCCGA CCCGGAGGTA TTCACCAAAG CCCCGGTCCT ATCCTCGGTG GGATATGGTT CCGGCGTTGG TATTCCCAGC TTTTCCTCAT GGAATAACCC CGAGCCCGAG CTTGTGCTCA TTCTCAACTC CGCCGGGGAT CCCCTGGGTG CCACGCTGGG GAACGACGTG AATCTCCGGG ACGTGGAGGG ACGCAGCGCG CTGTTGCTCG GGATGGCCAA GGACAACAAT GCATCCTGTG CGATTGGTCC GCTGATCCGG CTCTTCCACA AGGACTTCAC ACTGGAGTCC CTCCGGACCG AAGAAATCAC CCTCACGGTC GAAGGTACGG ACGGCTACCG GATGGAGGGT CGCAACAGCG TCGCCCGCAT CAGCCGCTCC TTCGAAGAGC TCATCAAGGC AGCGCACGGC AGCCACCACC AATACCCCGA CGGGTTCGCC CTCTTCACAG GAACGCTGTT CGCCCCCACC CAGGACCGCG ACACGGAAGG CCTCGGCTTC ACGCACAAGA ACGGCGACAT CGTCACCATC AGCAGCCCCC AGCTCGGAAC GCTCATCAAC CAAACCCAGA GCACCGAGGA AACGGAGCCT TGGACGTTCG GAATCACTGC CTTGTTCAGG TACCTCGCCC AGACAGGAAC CGGTGGCGCG CCATCGATCA CCGGGAGCCC GGTATCCGCG GCGCCCGGCA AATCGGCGGG CATGAAAATC GGATTCCGGA ACTAA
|
Protein sequence | MISMKIQQIL PADPAKAILI GRIWDPVSQG PRVVSVRGED VFDLSPGVRT VAELMEYENP LEVVLQRTDA PRWDLAEIVE ASQGTDRNRP HLLAPIDLQV VKACGVTFVE SMIERVIEER CGGDFTRAAE VRKAVADVLG GSLDSVRPGS EQAREAKRIL SAQGMWSQYL EVGLGPDPEV FTKAPVLSSV GYGSGVGIPS FSSWNNPEPE LVLILNSAGD PLGATLGNDV NLRDVEGRSA LLLGMAKDNN ASCAIGPLIR LFHKDFTLES LRTEEITLTV EGTDGYRMEG RNSVARISRS FEELIKAAHG SHHQYPDGFA LFTGTLFAPT QDRDTEGLGF THKNGDIVTI SSPQLGTLIN QTQSTEETEP WTFGITALFR YLAQTGTGGA PSITGSPVSA APGKSAGMKI GFRN
|
| |