Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_2282 |
Symbol | |
ID | 5420737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 2543665 |
End bp | 2546910 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640881535 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001417182 |
Protein GI | 154246224 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.41488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGCGC GCGCCGCTGT CGTTCCTCCG GCACCGCAAA TGGCCGGGGT CCCGTTTTCG AGCCCTGCCG TGCTGGGCCG GCGGGCACGG CTGACGCGCC TCCTTTGCGC CTCCACCGCG CTGGCGACGC TTCTGGCAAA CCCGGCGTTT GCCGCGCCCC AGGGCGGGAC AGTGGTCTCG GGGACGGCGA CCATTTCGAC TTCGGGCACC ACCACCAACA TCAATCAATC CACCAACAAG GCCATCATCA ACTGGACGAA CTTCTCGGTC GCCGCCGCGG AGACCGTGAA CTTCAACCAG CCGTCGGCCG CCTCAGTCAC GCTGAACCGC GTGATCGGCA ACGAAAGCTC GGTCATCGCC GGCGCCATCA ACGCCAACGG ACAGGTGTTC CTGGTCAATT CCAACGGCAT CCTCTTCACC GGCACCAGCC AGGTGAACGT GGGCGGCCTC GTCGCCTCGA CGCTGGACAT CAGCAACGCC GACTTCCTCG CCGGCAACTA CGTGTTCTCC GGCTCCTCCA CCGCGTCGGT GGTGAACCGG GGCAACATCA GCGCGGCGAG CGGCGGCTAC GTCTCGCTCA TGGGCAAGAG TGTGTCGAAT GAAGGCGTCA TCACCGCCAC GCTGGGCACC GTCTCGCTGA ATGCCGCCTC GAAGATGACC CTCAATTTCG AGGGCAATTC CCTTGTTGAC GTGACCCTCG ACGAGGGAGT GCTGAACGCC CTTGTGGAGA ACAAGCAGCT CATCAAGGCC GACGGCGGCA AGGTGATCAT GACCGCCAAG GCGGCGGACG CGGTTCTCTC CGCGCAGGTG AACAACAGCG GGATCATTCA GGCCCGCACC ATGGCCGCGC TGACGGGCGG TTCATCGGCG AAGACCTACA AGAAGGGCTC CATCACCCTG AAGGCGGACG GCGGCACCAC GAAGGTCTCC GGCACGCTGG ATGCCTCCGC CCCCAACGGC GGCGACGGCG GCTCCATCGA GACTTCCGGC AACACGGTGA AGATCAGTGA TGCCGCCACC ATCACCACGC TCGCCAGCAG CGGCAACACC GGCAGCTGGC TCATCGATCC CGACGGCTTC ACCATTGGCC GTTACGCCTA TTGGACGCGG CCATGGGTCC GCCGGGCGCA TGATGATGGC GGCGACATCA GCGCCAGTAA GTTAAGCTCC CTGCTTGCCA CCACCAGTGT TGAGATACAG TCCACCGACG GCAGCGGCAC CGACGGCGAT ATCACTGTCA ATGCTGCGGT CTCATGGTCG GCCAACACCA CGTTGACCCT CACCGCCACC AACGACATCA ACATCAATGC ACCGATCACC GCCACGGGGA CCAGTGCCGG GCTCAATCTG AACTATGGCG GCGATTATTA CATCGCCAAC GGCGCTGCGG TGACCTTGAG CGGTGCCGAC GCCAGCCTCG TCATGAACGG CCAAGCCTAT ACGCTCATCC ACACCATGGC GCAGTTGGCG GCCCTCGACG ATGCCACCGG AACCGCATCG GGCTTTTACG CCATCGCCAA CGATATCGAC GCCAGCGCCG CCGCCTACGA CGGCCCGGTG ATCGCCAAGC TCAGCGGGAC GCTGGCAGGC CTGGGGCACA CCGTCAGCGG GCTCACCATC TCCTCTGGCG GCAGTTCTGT GGGCCTGATC GGCTCCATCG GCACTGGCAG CAAAGTCCGC GACCTCGGCC TGCTGGACGC GGACATCACT GGAAATTCCA CCGTCGGCGC CCTTGCGGGC GAAAACAACG GCATCGTCAG CAACAGCTAC GCAACCGGCG CCGTGACGAG CCTGTCCAGC GGCTGGGCCG GCACCGGCGG TCTGATCGGC GAAAACTACG GCACGATCAC CGGCTCCCAC GCGGCCGTGA CGGTCACGAG CTCCGGCAAT TACGTCGGTG GGCTGGTTGG CTACAGCTAC GGCATCATTA CCGGGTCCTA CGCTACCGGT GATGTCTCGG GCTACATGAC CGTTGGCGGG CTGGTCGGCG GCGGTGGCGG AACTGTAAGC GATGCTTATG CCACCGGCGA CGTAACCGGG AACAATTATG TCGGCGGACT GGCTGGAACT GCCGGCGGCA CCTACACCAA CGTCTATGCG ACCGGAAATG TGACCGGCGT CCAGACCGTA GGCGGCCTCA TCGGCTACAG CAACGGCGTC AAGTTGAACA CCGCCTATGC CAGTGGAAAC GTTACAGGCG CGGGCCAGGA CGTCGGCGGA CTTATCGGTA TGAGCAATTA TGGAACGCTC ACCAACGTTT CTGCATCCGG AACCGTGACC AATACTGGAA GCTATACCGG CGGCATCGTT GGAAGGGTCG TGGGAACGAC GATCAGCGAT GCGTCCTTCT CCGGAACAGT GACCGGGGCC TTCGGCACCG GCGGCATCGC GGGCTTCAAT GGGGGAACCA TCACCGATTC CCAGGTGACC GGGAGCGTGA CGGGTACCGC CGCCGTAGGC GGTATCGCCG GCCTCAATCT CGGCGGAACC ATCAGCAACG TTCTGTTTGC CGGCAGCGTG ACAGGCATCC TGACGGTGGG CGGTGTGGCC GGCACCAGCA CCGGCACCAT CGACAACGCC ACCTCGACAG GAACGGTCTC CGGAACGAGC AATGTCGGGG GTATCGCCGG CAACAATCTC GGTGGCATCA CCAATTCGTC TGCCACCGGC GCAGTCTCCG GGACGCAGAA TACCGGAGGG CTGGTCGGCA ACAACATCGG CGCGGGCAAG GTTTCCAACA GCACCTGGAA CACCACATCC ACCGGCCAGA CCGGAGGCGT GGGCAACGGC CAAGGGATCG GCTCCTCGGT GTCGGGCGTG ACGTCGCCCA CGATCCCTGC CTATGCAACG GATGCTGTGC AATCGGCCGT GCGCGCAGCC ACGGTCAGCG CCACCACCGG CGTGCAGGAG ACGGCGGACA ACCCGCCATC GACCTCAGAC GTGGCGGCGG GGAGTTCGGC GACGGCAGCG CTTGCCGGGC CTTCGGTGGC CTCCCAGATC GACAGCAGCG CAGCGCCGGC GCCCACCACC TCCTGGCGGC AGCTGCAGGA CCAGGAAGAG CGCCGCAGGA AGCGCGCCGC ACAGCAGGTG CAGCCGGCTT CGGCACCGAA GGGCGGGGTT GGCGGGTCCA TCCGCTCCAT CGACGTGGAC GGCCAACATT TCGACCTGGA GAAGGACTCA GCGCCTGCGT CTTCCGCGCC GGCACAGCCG GCCGCTCCCG CTCCGGCGGC CGCACCGGCG CAGTAA
|
Protein sequence | MRARAAVVPP APQMAGVPFS SPAVLGRRAR LTRLLCASTA LATLLANPAF AAPQGGTVVS GTATISTSGT TTNINQSTNK AIINWTNFSV AAAETVNFNQ PSAASVTLNR VIGNESSVIA GAINANGQVF LVNSNGILFT GTSQVNVGGL VASTLDISNA DFLAGNYVFS GSSTASVVNR GNISAASGGY VSLMGKSVSN EGVITATLGT VSLNAASKMT LNFEGNSLVD VTLDEGVLNA LVENKQLIKA DGGKVIMTAK AADAVLSAQV NNSGIIQART MAALTGGSSA KTYKKGSITL KADGGTTKVS GTLDASAPNG GDGGSIETSG NTVKISDAAT ITTLASSGNT GSWLIDPDGF TIGRYAYWTR PWVRRAHDDG GDISASKLSS LLATTSVEIQ STDGSGTDGD ITVNAAVSWS ANTTLTLTAT NDININAPIT ATGTSAGLNL NYGGDYYIAN GAAVTLSGAD ASLVMNGQAY TLIHTMAQLA ALDDATGTAS GFYAIANDID ASAAAYDGPV IAKLSGTLAG LGHTVSGLTI SSGGSSVGLI GSIGTGSKVR DLGLLDADIT GNSTVGALAG ENNGIVSNSY ATGAVTSLSS GWAGTGGLIG ENYGTITGSH AAVTVTSSGN YVGGLVGYSY GIITGSYATG DVSGYMTVGG LVGGGGGTVS DAYATGDVTG NNYVGGLAGT AGGTYTNVYA TGNVTGVQTV GGLIGYSNGV KLNTAYASGN VTGAGQDVGG LIGMSNYGTL TNVSASGTVT NTGSYTGGIV GRVVGTTISD ASFSGTVTGA FGTGGIAGFN GGTITDSQVT GSVTGTAAVG GIAGLNLGGT ISNVLFAGSV TGILTVGGVA GTSTGTIDNA TSTGTVSGTS NVGGIAGNNL GGITNSSATG AVSGTQNTGG LVGNNIGAGK VSNSTWNTTS TGQTGGVGNG QGIGSSVSGV TSPTIPAYAT DAVQSAVRAA TVSATTGVQE TADNPPSTSD VAAGSSATAA LAGPSVASQI DSSAAPAPTT SWRQLQDQEE RRRKRAAQQV QPASAPKGGV GGSIRSIDVD GQHFDLEKDS APASSAPAQP AAPAPAAAPA Q
|
| |