Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2035 |
Symbol | |
ID | 4445444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2294471 |
End bp | 2295718 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639689843 |
Product | VWA containing CoxE family protein |
Protein accession | YP_831515 |
Protein GI | 116670582 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0522892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAT TCGGCGCCGC CATCAGCCCG CCCGGAGTGG AGGCCGCTTC CCTGGCCGCC GGCCTTGTCA CGGCGCTGCG CCGTGCCGGC CTGTCGACTT CCCCGGACCG CGCCGTTCGG CTGGCTGAAG CGCTCCGCCT GATCCCGCCC TTCGCCCGCG AACAGCTTTA CTGGACCTGC CGGGTGGTCC TGGTGGCCTC CCGGGAGCAG GTGCCGGTTT TCGACACCGT CTTTGCCGCC GTGTTTGACG GCCGGCTTGA CCCTGCCGAC AGCCGGGGCG CCACCAAGGC ACCCCCGGCA CTCATCTCCG GGGCGCGCCT CCGCCCGGCC CCGCCGTCAG GACTGGCGAG TGCGCCGGCG GCCGTCCCCA AGGCTTCGCC GCCCGCTCTG CTTCCCGGCA GGGAACACTC CGGCGAGGGG AACGGTCCTG AGCGCGAGGC CATCCTGGCC ATGGCGTCGA CGGAGGAAAG ACTGCACGAA ACGTCCTTCG CTGAGCTCTC CGCCGAGGAG GTGGTCGAGG TACGCCGGCT CGTTCGGGCG ATAGTCTTCG CCACGCCGGT ACGCCTGAGC CGCAGGACCC GCAAGTCTTC GCATAACAAT GCACGGCTGG ACATCCGGAG CACCGTCCGT GCAGCCCAGC GCACCGGCTC CGATGCCACA AGGCTCATCT ACGCCCATCG CCGGCCCCGT CCGCGGCAGC TGGTCCTGCT GTGCGATGTA TCGGCGTCCA TGGAGCCCTA CACGAGGGTG TTCCTGTCAT TACTGCAGGG CGCCGTGGCC GGAGCCCGGG CGGAGGCGTT CGTGTTCTCT ACCCGCCTGA CGCGGCTGAC GCGGCAGCTG GCAGTCCGGA ACCCTGATCA GGCGCTGGCC CGGGCCGCCG CGAGGGCGCC GGACTGGGCT GGAGGGACAC AGATCGCGGA AAGCCTGCGG AGCTTCATTG ATGCGCACGG CCGCCGCGGT TTGGCCAGGG GTGCGGTGGT GGTTGTGCTC TCCGACGGCT GGGCTCAGGA TGATCCGGAC CTCGTGGCCA CGCAAATGGA ACGCCTGAAG CGGCTCGCCT ACCGGATTGT CTGGGTTAAT CCGAGGAAGG CGGACGTGAA CTACCGGCCG CTGGCGGGAG GTATGGCGGC GGCGCTCCCC TACTGCGACG CCTTTGTCAG CGGCCACAAC TATGCGGCCC TCGCGGAAGT GGCAGCCGCG GTCCGGGACG GGCGCAGGAC AACACAAGAG AGGCAGGGCA ACAGCTAA
|
Protein sequence | MAEFGAAISP PGVEAASLAA GLVTALRRAG LSTSPDRAVR LAEALRLIPP FAREQLYWTC RVVLVASREQ VPVFDTVFAA VFDGRLDPAD SRGATKAPPA LISGARLRPA PPSGLASAPA AVPKASPPAL LPGREHSGEG NGPEREAILA MASTEERLHE TSFAELSAEE VVEVRRLVRA IVFATPVRLS RRTRKSSHNN ARLDIRSTVR AAQRTGSDAT RLIYAHRRPR PRQLVLLCDV SASMEPYTRV FLSLLQGAVA GARAEAFVFS TRLTRLTRQL AVRNPDQALA RAAARAPDWA GGTQIAESLR SFIDAHGRRG LARGAVVVVL SDGWAQDDPD LVATQMERLK RLAYRIVWVN PRKADVNYRP LAGGMAAALP YCDAFVSGHN YAALAEVAAA VRDGRRTTQE RQGNS
|
| |