Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11930 |
Symbol | comL |
ID | 7760135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1144779 |
End bp | 1145792 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643804094 |
Product | competence protein ComL |
Protein accession | YP_002798396 |
Protein GI | 226943323 |
COG category | [R] General function prediction only |
COG ID | [COG4105] DNA uptake lipoprotein |
TIGRFAM ID | [TIGR03302] outer membrane assembly lipoprotein YfiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCTGA AACACCTGCT GCTGATCGCC TCGCTCGTTC TCATCGCCGC CTGCGGATCG AAAAAAGAGA AGGAGGAAGT CGTCGACGAG AATCTGAGCG AGACCGAGCT GTACCAGCAG GCCCAGAACG ACCTCAACAA CGAGAACTTC GGCTCCGCGA CCACCAAGCT GAAGGCCCTG GAATCGCGCT ATCCGTTCGG CCGCTACGCC GAACAGGCAC AGCTCGAACT GATCTACGCC TACTACAAGA GCCAGGAAAC GGACGCCTCG CGCTCGGCCG CGGAGCGTTT CATCCGCCTG CACCCGCAAC ACCCGAACGT CGACTACGCC TACTACCTCA AGGGGCTGGC TTCCTTCGAC CAGGACCGGG GGCTGCTGTC GCGCTTCCTG CCGCTGGACA TGACCAAGCG CGATCCGGGC GCGGCCCGCG ACTCCTTCAA CGAGTTCGCC CAACTCACCA GCCGCTTCCC GAACAGCCGC TATGCGCCGG ACGCCAAGGC GCGCATGGTC TATCTGCGCA ACCTGCTGGC CGCCTACGAG ATCCATGTCG CCCATTACTA CCTGAAGCGC GAAGCCTACG TCGCTGCCGC CAACCGGGGT CGCTATGTTG TGGAGAACCT CCAGGAAACG CCCGCGGTGG GCGACGGCCT GGCGGTGATG ATCGAAGCCT ACCAGCGCAT GACCCTGGAC GAACTGGCCA CCACCAGCCT GGAAACCCTG AAGCTGAACT ATCCCGATCA CCCCAGCCTG CAGAACGGCC AGTTCGTACC GCTGGAGGAA GAGGACGACA ACCGCTCCTG GCTGGGCAAG GCGACTCTCG GCCTGATCGA GAGCAAACCG CCTCTGCCGC CGGGCGAAAC CCGCGCCAAT CAGGACATCC GCCGCATGTA CGAGGAGGCC CGTCAGGAAA TCCGCGCCGA CCTGAAGCAA GGCGAGGAGA CCCGCGAAGC GGTCGAAAGC GCCGATCGCA AGCCCAAACG CGCCTGGTGG AACCCCCTGC GCCTGCTCGA CTGA
|
Protein sequence | MHLKHLLLIA SLVLIAACGS KKEKEEVVDE NLSETELYQQ AQNDLNNENF GSATTKLKAL ESRYPFGRYA EQAQLELIYA YYKSQETDAS RSAAERFIRL HPQHPNVDYA YYLKGLASFD QDRGLLSRFL PLDMTKRDPG AARDSFNEFA QLTSRFPNSR YAPDAKARMV YLRNLLAAYE IHVAHYYLKR EAYVAAANRG RYVVENLQET PAVGDGLAVM IEAYQRMTLD ELATTSLETL KLNYPDHPSL QNGQFVPLEE EDDNRSWLGK ATLGLIESKP PLPPGETRAN QDIRRMYEEA RQEIRADLKQ GEETREAVES ADRKPKRAWW NPLRLLD
|
| |