Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49490 |
Symbol | |
ID | 7763803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5011929 |
End bp | 5012906 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643807784 |
Product | Uncharacterized protein family UPF0065 |
Protein accession | YP_002802019 |
Protein GI | 226946946 |
COG category | [S] Function unknown |
COG ID | [COG3181] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00178438 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCCAT CGATCCGTCG CGCCCTGCTG GCGGCCGCCG CGCTGCTGCT GTGCAGCGCC CGGGCCGCCG AACCGCACGC CGCGGAATGC ATAGTCCCGG CGGCCAGCGG CGGCGGCTTC GACCTGGTCT GCAAGCTGGC GCGCGAGGCG CTGCAGGAGG CCGGCCTGAC CCGCCGGCCG CTGCGCCTCG CCTACATGCC GGGCGGGGTC GGCGCGGTGG CCTACAACAC CATCGCCGCG CAGCGGCCGG CCGAGCCCGA CACCCTGGTC ACCTTCTCCA GCGGTTCGCT GCTGAACATC GCCCTGGGCA AGTTCGGCCG CTTCGACGAG AGCGCGGTGC GCTGGCTGGC GGTGATCGGC ACCAGCTACG GCGCCCTGGC GGTACGCGCC GACTCGCCCT ACAAGACCCT GGACGATCTG CTCGCGGCGC TCAGGAAGGA CCCCGACTCG GTGGTGATCG GCGCCTCCGG TACCGTCGGC AGCCAGGACT GGATGCAACT CGCCCTGCTC GCCCGGCTGG CCGGCATCGA TCCGCGCGAG CTGCACCACG TCGCCCTGGA GGGCGGCGGG GAAATCTCCA CGGCGCTGGT CGCCGGCCAC GTGCAGGTGG GCAGTACCGA CATCTCCGAC TCCATGCCGC ACCTGAACGG CGGCGCCATC CGCCTGCTGG TGGTGTTCGC CGAACGGCGC CTGGACGAGC CGGGGATGGC CGCCATCCCG ACCGCCCGCG AGCTGGGCTA CGACGTGGTC TGGCCGGTGC TGCGCGGTCT CTACATGGGG CCCGGGGTGA GCGACGCCGA CTACCGGCGC TGGAAGGACG CCTTCGACCG CCTGCTCGCC TCCGAGGACT TCGCCCGCCT GCGCGACCGC TACGAGTTGT TCCCCTATGC CCTGACCGGC GAAGCCCTGG AGGCGCATGT CAGGCAGCAG GTCGCCCGCT ATCGGGAAAT GGCCCGGGAC TTCGGCCTGA TCCGCTGA
|
Protein sequence | MGPSIRRALL AAAALLLCSA RAAEPHAAEC IVPAASGGGF DLVCKLAREA LQEAGLTRRP LRLAYMPGGV GAVAYNTIAA QRPAEPDTLV TFSSGSLLNI ALGKFGRFDE SAVRWLAVIG TSYGALAVRA DSPYKTLDDL LAALRKDPDS VVIGASGTVG SQDWMQLALL ARLAGIDPRE LHHVALEGGG EISTALVAGH VQVGSTDISD SMPHLNGGAI RLLVVFAERR LDEPGMAAIP TARELGYDVV WPVLRGLYMG PGVSDADYRR WKDAFDRLLA SEDFARLRDR YELFPYALTG EALEAHVRQQ VARYREMARD FGLIR
|
| |