Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1873 |
Symbol | |
ID | 4445597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2107241 |
End bp | 2109217 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689686 |
Product | hypothetical protein |
Protein accession | YP_831358 |
Protein GI | 116670425 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.773936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGA TGAGCAGGCC GCCAGCGGAT CCCCAGGCGG CTCGGCCGGG CGCCGGGGAC AACTTCCGGA ACTGGCTGCT GTTCGGCCTG GTGGACGCCA AGGGGATCCA CCAGGGGCCC GGAGCGGTGA GCGATTCGCA CCTGAAGAAG CATCCGTGGT GGCAGGTGAT GTGCCTGACG GGTGTCGATT ACTTCTCCAC CTTGGGTTAC CAGCCTGCAA TCGCGGCGCT GGCGGCGGGC GTGATCTCTC CGCTCGCGAC CGTGGTGCTG GTCGCTGTCA CCTTGCTCGG CGCCCTGCCC GTGTACCGCC GGGTAGCCGG GGAGAGCCAC CGGGGTGAGG GGTCCATCGC CATGCTGGAG AGGTTGATGC CGCGCTGGGG CGGGAAGCTC TTGGTGCTGG TGCTTCTGGG GTTCGCGGCG ACGGACTTCA TGATCACCAT GACGCTCTCC GCAGCCGATG CCACGGCGCA CGCGCTCCAG AATCCCTTCA CGCCGGCCTG GATGCAGGGA CAGAACGTGC TGCTCACGCT GTTCCTGCTG GCCCTGCTGG GGGCTGTGTT CCTGCGGGGC TTCAAGGAGG CCATCGGTGT GGCGGTAGTT CTGGTCGGTG TTTACCTGGG CCTGAATGTG GTGGTGGTGG CCGCGACCGT ATTCGAGGCT GTTACCCATC CGGTTGCTGT GGGGGACTGG TGGCACGCGC TGACCACCTC GCACGGGAAT CCGTTCATGG TGGTCGGGAT CGCGCTGCTG GTCTTCCCGA AGCTGGCGCT GGGCCTGTCC GGTTTCGAGA CCGGGGTCGC CGTGATGCCA CAGATTAGGG GCCGGCCAGG GGATACCGAG GACAAACCGG TCGGCCGGAT CGAGGGGGCC CGCCGGCTCC TGACAACCGC GGCAGTCATC ATGAGTTCCT TCCTGATCAC CACCAGCTTC ACCACAGTGA TCCTGATCCC GGAGCAGGAA TTCCAGCCCG GCGGCCAGGC CAACGGCCGC GCCCTGGCGT TTTTGGCCCA CGAGTACCTC GGCGCGGGTT TCGGGACCGT GTATGACATG AGCACCATCG CCATTCTCTG GTTCGCCGGC GCCTCCGCGA TGGCGGGCCT GCTGAACCTG GTCCCGCGGT ACCTGCCCCG CTACGGTATG GCCCCGGGGT GGGCGCGGGC GGTGCGGCCG CTGGTGCTCG TGTTCACTGC GGTCGGATTC CTGATCACCT GGCTCTTCGA CGCCGACGTC GACGCCCAGG GCGGCGCCTA CGCCACCGGC GTCCTGGTGC TGATGACCTC GGCTGCGGTG GCGGTCACAC TGTCGGCCCG TCGGCGGCAG CAAAGCAAGC GCACCCTGGG GTTCGGTGTC ATTGCCGTGG TGTTCATTTA CACGACGGTG GCCAACATTT TTGAACGGCC CGAGGGCATC CGGATCGCCT CGTTCTTCAT CGCAGGCATC ATCGTGATCT CGTTGCTCTC CCGGATCCGG CGGTCCTTTG AACTCCACGC CACCCACGTC CACCTGGACC GGCAGGCGCT GGAATTCATG TCCTCCAACG TGTCCGGCCC GATCGCGCTC ATCGCTCACG AACCCCTCCG GCTGAGCCCG GAGGCATACC GGGACAAGTT GACCTCCGCA ATCGAGGTCA GCCACCTTCC GCTTGAGCAC CAGGCGCTGT TCCTGGAAGT GATCGTGGAC GATTCCTCCG ACTTTGAGAC AGAACTCGAG GTCCGCGGCG TGACCCGGCA CGGATACCAG ATCCTGGAGG TCCACGGACC GGTCGTGCCG AACACGATCG CCTCGGTCCT GCTGCACATC CGCGACGTGA CGGGCCTGAT GCCGCACATT TACTTCCGCT GGACGGAGGG CAACCCGATC ATCAACCTGC TCAAGTTCCT CTTCCTGGGC GAAGGTGAAA TCGCTCCGGT GACCCGAGAA GTACTCCGCG AAGCCGAACC GGACGTCACC AAACGACCAT GGGTCCACGT CGGCTAA
|
Protein sequence | MTTMSRPPAD PQAARPGAGD NFRNWLLFGL VDAKGIHQGP GAVSDSHLKK HPWWQVMCLT GVDYFSTLGY QPAIAALAAG VISPLATVVL VAVTLLGALP VYRRVAGESH RGEGSIAMLE RLMPRWGGKL LVLVLLGFAA TDFMITMTLS AADATAHALQ NPFTPAWMQG QNVLLTLFLL ALLGAVFLRG FKEAIGVAVV LVGVYLGLNV VVVAATVFEA VTHPVAVGDW WHALTTSHGN PFMVVGIALL VFPKLALGLS GFETGVAVMP QIRGRPGDTE DKPVGRIEGA RRLLTTAAVI MSSFLITTSF TTVILIPEQE FQPGGQANGR ALAFLAHEYL GAGFGTVYDM STIAILWFAG ASAMAGLLNL VPRYLPRYGM APGWARAVRP LVLVFTAVGF LITWLFDADV DAQGGAYATG VLVLMTSAAV AVTLSARRRQ QSKRTLGFGV IAVVFIYTTV ANIFERPEGI RIASFFIAGI IVISLLSRIR RSFELHATHV HLDRQALEFM SSNVSGPIAL IAHEPLRLSP EAYRDKLTSA IEVSHLPLEH QALFLEVIVD DSSDFETELE VRGVTRHGYQ ILEVHGPVVP NTIASVLLHI RDVTGLMPHI YFRWTEGNPI INLLKFLFLG EGEIAPVTRE VLREAEPDVT KRPWVHVG
|
| |