Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_26570 |
Symbol | |
ID | 7761565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2718194 |
End bp | 2719225 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643805535 |
Product | Pentapeptide repeat protein |
Protein accession | YP_002799808 |
Protein GI | 226944735 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.146804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGCGCG GCGAGGCCAT CGAGGGGCGC GCGCTGCCGG GCTTGCGCAT GGCTGGTCTG GAACTGGCCG GCGGCCTGTT CCGCGGCTGC GACCTGAGTG GCAGCGACTG GCAGGGGACG ACGCTGCGGG CGTGCCTGTT CATCGACTGC GCGCTCGAGG CGGCCGATTT CCGCATGGCC CGGCTGGACG AGGTGCAGTG GCAGAACTGC GCGCTGGCGG ATGCCGCGTT CGCGGGCGCC GAGCTGGAGC GCTGCCAACT CATCGAGTCC GGCCTGGCGC GGGCCGTGTT CGACGGGGCC CTGCTGGGGG GCCTGGCGTT CATCCAGTCG GTGCTGGCGC ACGCTTCGTT CGAAGAGGCG ACCCTGGCCG AGACTTCGTT CAACGAATGC GACCTGCGGG ATTGCCGGTT CGCTCGTGCC CATCTGAGCG AAACCACCCT GTTCGAGCTG GACCTGTGCT CGGTGGATTT CGCCGAAACC CGCTTCGAGT CGGTCATCTT CAGCGGTTCG AACCTGTCCG GGCAGTGCCT GATGCGCACC TCGCTGGCCG GCTGCCAGTT CTCCGCGGCG CTCCTGGACG GCTGCGACCT GACGGAGGCG GTCCTGAGCC AGGCGGTGTT CAAGGACGCC AGCCTGGTGG GTGCCCGGCT GACGGGGGTG GAGGCACGCT ATGCGCTGTT CCCGGACGCC GACCTGAGCG ACGCGGATTG CCGCCAGGGC CGTTTCGCCC AGAGCGTCTG GGCGGGCGCG CAACTCGCCG GCGCCGATTT TTCCCAGGCC TGCCTGGACA TGGCCGTGTT GCAGCGGACA TGGGCCAGGG GCGCCCGCTT CGAGCGGGCC AGTCTGCGCC ACGCCGAATT GTCCTATGCC GATTTCACCG GCGCCGACTT CGCCGGCGCC CTGTTCGAGC GCACGTCCTT TCACCGTTCG CTGCTGGAGG ACGCCCGCTT CGATTCGCGC GACGGGCTGA TCGAGCGCGA CGAGGAACTC TGGGCGGCGG AGGAGCGCGC GCAGGCCGGG TCGCGCCGAT AG
|
Protein sequence | MRRGEAIEGR ALPGLRMAGL ELAGGLFRGC DLSGSDWQGT TLRACLFIDC ALEAADFRMA RLDEVQWQNC ALADAAFAGA ELERCQLIES GLARAVFDGA LLGGLAFIQS VLAHASFEEA TLAETSFNEC DLRDCRFARA HLSETTLFEL DLCSVDFAET RFESVIFSGS NLSGQCLMRT SLAGCQFSAA LLDGCDLTEA VLSQAVFKDA SLVGARLTGV EARYALFPDA DLSDADCRQG RFAQSVWAGA QLAGADFSQA CLDMAVLQRT WARGARFERA SLRHAELSYA DFTGADFAGA LFERTSFHRS LLEDARFDSR DGLIERDEEL WAAEERAQAG SRR
|
| |