Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oant_2861 |
Symbol | |
ID | 5382010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ochrobactrum anthropi ATCC 49188 |
Kingdom | Bacteria |
Replicon accession | NC_009668 |
Strand | - |
Start bp | 140080 |
End bp | 141180 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640835536 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001371398 |
Protein GI | 153010184 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.816183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGTC GGGAAGTTCT TAAATCCGGT GCGGCTTGCG CTGCTTTCAG CATGCTGCCG AATTTTGCAT CTGCAGCCGG TAATACATTC GCACCGCAAC CGTCGGGCTG GCGCAGCTTC GAGATTACCA CCAGCATCAC GCCTGCCACT TCGGGGGAAG TCATTCGTGT CTGGGTTCCG ATGGCCGGGT TCAATGGCAA TGACTGGAAT CGTCCGGAAG GCAATCGCTG GACGACAAAC GCCAAGGTCG CGCGGCTCGA TCGCGATCCG GCCAGCGGCA CCGGCTTGCT TTACCTCGAA TGGGACGCAA ACGAAGCGAA ACCGCAAGCC GAAGTCATCA GCCTCATCTC CACCCGCGAC CGTGCAAGCA ATCTCGATGC ATCGGCAAAG GCCGTGCCGC TGACCCGTGC CGAACGCGAA CGCTATCTGG CCGGGACACG CCTTGCGCCT GTTGATGGCA TCGTCAAGCA AACCTCGGAC AAGATCGTTG CGGGTGCGGA CACCGATCAG GAAAAGGCCC GCCTCATCTA TGACTGGGTT GTAGCCAACA CCTACCGCCG TGCCTCGACG CGTGGCTGCG GTGATGGAAA CATTGTCGCC ATGCTTGATA GCGGCGATCT CGGCGGCAAA TGCGCCGACC TCAATCCGCT GTTCGTGGCT CTGGTCAGGG CAGCAGGAAT TCCGGCCCGC GACCTCTACG GCGTGCGCGT TGCGCCTTCC GCCTTCGGCT ACAAGAGCCT TGGCGCGAAG AACGAAGTCA TCACCAAGGC CCAACACTGC CGCGCCGAAA TCTATCTCGA TGGTATTGGC TGGGTGGCGA CCGATCCGGC GGATGTACGC AAGGTGATGC TGGAAGAGGA AAAGGACGGG CTGACTGCCG ACGATCCGCG TGTCGTGGCC GTGCGGCAAA AGCTTTTCGG CTCATGGGAG GGCAACTGGA TCGCCTTCAA TGATGGCAGC GATATCGCCT TGCCATCTGC GCAAGGACCG GAGCTCGGTT TCCTGATGTA TCCGCAGGCC GAAGTTGCCA GCGCCCGTCT GGATTGCCTC GACGCGGATG CATTCCGTTA CGCGATGACC GCACGCGAAA TAACAATTTA G
|
Protein sequence | MNRREVLKSG AACAAFSMLP NFASAAGNTF APQPSGWRSF EITTSITPAT SGEVIRVWVP MAGFNGNDWN RPEGNRWTTN AKVARLDRDP ASGTGLLYLE WDANEAKPQA EVISLISTRD RASNLDASAK AVPLTRAERE RYLAGTRLAP VDGIVKQTSD KIVAGADTDQ EKARLIYDWV VANTYRRAST RGCGDGNIVA MLDSGDLGGK CADLNPLFVA LVRAAGIPAR DLYGVRVAPS AFGYKSLGAK NEVITKAQHC RAEIYLDGIG WVATDPADVR KVMLEEEKDG LTADDPRVVA VRQKLFGSWE GNWIAFNDGS DIALPSAQGP ELGFLMYPQA EVASARLDCL DADAFRYAMT AREITI
|
| |