Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_31510 |
Symbol | |
ID | 7762050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3258184 |
End bp | 3259005 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806025 |
Product | NLPA lipoprotein |
Protein accession | YP_002800289 |
Protein GI | 226945216 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1464] ABC-type metal ion transport system, periplasmic component/surface antigen |
TIGRFAM ID | [TIGR00363] lipoprotein, YaeC family [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0866594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTCC CGTCCCTCTC CCGCCGCCAC TGGCTCAAGA CCCTCGCCTG CGGCGCCCTG CTCGGCTTCT CCACCCTGGG CATGGCCAGC GATGCCCCTC TGAAGATCGG CACCACCGCG GCCTTCGCCC CGCCTCTCGA AGTGGCGGTC GCCGAAGCCG CCAAGGAAGG CATCGAGGTC GATCTGGTCG AGTTCAGCGA CTGGATCTCG CCGAACACCA CCCTGGCCCA CGGCGACATC GACGCCAACT ACTTCCAGCA CATTCCGTTC CTGGAGAACG CCAGGAAGGA AGGCGGCTAC GACCTGGTGC CGGTGGCCCC CGGCGTGCTG AACAACGTCG GCCTCTATTC GAAGAAGTAC AAGAGCTTCG CCGAGCTGCC CGAGGGCGCC AAGGTGGCCA TCGCCAACGA TCCGGTGAAC GGCGGACGCG GCCTGCTGCT GCTGGAAAAG GCCGGGCTGA TCAGCCTCAA GCCGGGCATC GGCTACAAGG CCACCCTGGA CGACATCACC GCCAATCCGA AGAAGCTCGA CATCATCGAA CTGGAGGCGG TGCAACTGGT GCGCGCCCTG GACGACGTCG ACCTGGCCCA GGGCTATCCC CTCTACATCC GCCTGTCCAA CGCCGTCGAT CCGCACTCGG CGCTGCTGTT CGACGGCCTG GACCACCCGG AATACGTGAT CCAGTTCGTC GCCCGCCCGC AGGGCAAGGA CGATCCGCGC CTGCGCCGCT TCATCGACAT CTACCAGCAC TCGAGCGCGG TGCGCGCCGC GCTCGACCAG AGCCTGGGCG GCCTCTACGT GCCCGGCTGG GAGAAGAAAT GA
|
Protein sequence | MSFPSLSRRH WLKTLACGAL LGFSTLGMAS DAPLKIGTTA AFAPPLEVAV AEAAKEGIEV DLVEFSDWIS PNTTLAHGDI DANYFQHIPF LENARKEGGY DLVPVAPGVL NNVGLYSKKY KSFAELPEGA KVAIANDPVN GGRGLLLLEK AGLISLKPGI GYKATLDDIT ANPKKLDIIE LEAVQLVRAL DDVDLAQGYP LYIRLSNAVD PHSALLFDGL DHPEYVIQFV ARPQGKDDPR LRRFIDIYQH SSAVRAALDQ SLGGLYVPGW EKK
|
| |