Gene Xaut_2282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_2282 
Symbol 
ID5420737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp2543665 
End bp2546910 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content65% 
IMG OID640881535 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001417182 
Protein GI154246224 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.41488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCGC GCGCCGCTGT CGTTCCTCCG GCACCGCAAA TGGCCGGGGT CCCGTTTTCG 
AGCCCTGCCG TGCTGGGCCG GCGGGCACGG CTGACGCGCC TCCTTTGCGC CTCCACCGCG
CTGGCGACGC TTCTGGCAAA CCCGGCGTTT GCCGCGCCCC AGGGCGGGAC AGTGGTCTCG
GGGACGGCGA CCATTTCGAC TTCGGGCACC ACCACCAACA TCAATCAATC CACCAACAAG
GCCATCATCA ACTGGACGAA CTTCTCGGTC GCCGCCGCGG AGACCGTGAA CTTCAACCAG
CCGTCGGCCG CCTCAGTCAC GCTGAACCGC GTGATCGGCA ACGAAAGCTC GGTCATCGCC
GGCGCCATCA ACGCCAACGG ACAGGTGTTC CTGGTCAATT CCAACGGCAT CCTCTTCACC
GGCACCAGCC AGGTGAACGT GGGCGGCCTC GTCGCCTCGA CGCTGGACAT CAGCAACGCC
GACTTCCTCG CCGGCAACTA CGTGTTCTCC GGCTCCTCCA CCGCGTCGGT GGTGAACCGG
GGCAACATCA GCGCGGCGAG CGGCGGCTAC GTCTCGCTCA TGGGCAAGAG TGTGTCGAAT
GAAGGCGTCA TCACCGCCAC GCTGGGCACC GTCTCGCTGA ATGCCGCCTC GAAGATGACC
CTCAATTTCG AGGGCAATTC CCTTGTTGAC GTGACCCTCG ACGAGGGAGT GCTGAACGCC
CTTGTGGAGA ACAAGCAGCT CATCAAGGCC GACGGCGGCA AGGTGATCAT GACCGCCAAG
GCGGCGGACG CGGTTCTCTC CGCGCAGGTG AACAACAGCG GGATCATTCA GGCCCGCACC
ATGGCCGCGC TGACGGGCGG TTCATCGGCG AAGACCTACA AGAAGGGCTC CATCACCCTG
AAGGCGGACG GCGGCACCAC GAAGGTCTCC GGCACGCTGG ATGCCTCCGC CCCCAACGGC
GGCGACGGCG GCTCCATCGA GACTTCCGGC AACACGGTGA AGATCAGTGA TGCCGCCACC
ATCACCACGC TCGCCAGCAG CGGCAACACC GGCAGCTGGC TCATCGATCC CGACGGCTTC
ACCATTGGCC GTTACGCCTA TTGGACGCGG CCATGGGTCC GCCGGGCGCA TGATGATGGC
GGCGACATCA GCGCCAGTAA GTTAAGCTCC CTGCTTGCCA CCACCAGTGT TGAGATACAG
TCCACCGACG GCAGCGGCAC CGACGGCGAT ATCACTGTCA ATGCTGCGGT CTCATGGTCG
GCCAACACCA CGTTGACCCT CACCGCCACC AACGACATCA ACATCAATGC ACCGATCACC
GCCACGGGGA CCAGTGCCGG GCTCAATCTG AACTATGGCG GCGATTATTA CATCGCCAAC
GGCGCTGCGG TGACCTTGAG CGGTGCCGAC GCCAGCCTCG TCATGAACGG CCAAGCCTAT
ACGCTCATCC ACACCATGGC GCAGTTGGCG GCCCTCGACG ATGCCACCGG AACCGCATCG
GGCTTTTACG CCATCGCCAA CGATATCGAC GCCAGCGCCG CCGCCTACGA CGGCCCGGTG
ATCGCCAAGC TCAGCGGGAC GCTGGCAGGC CTGGGGCACA CCGTCAGCGG GCTCACCATC
TCCTCTGGCG GCAGTTCTGT GGGCCTGATC GGCTCCATCG GCACTGGCAG CAAAGTCCGC
GACCTCGGCC TGCTGGACGC GGACATCACT GGAAATTCCA CCGTCGGCGC CCTTGCGGGC
GAAAACAACG GCATCGTCAG CAACAGCTAC GCAACCGGCG CCGTGACGAG CCTGTCCAGC
GGCTGGGCCG GCACCGGCGG TCTGATCGGC GAAAACTACG GCACGATCAC CGGCTCCCAC
GCGGCCGTGA CGGTCACGAG CTCCGGCAAT TACGTCGGTG GGCTGGTTGG CTACAGCTAC
GGCATCATTA CCGGGTCCTA CGCTACCGGT GATGTCTCGG GCTACATGAC CGTTGGCGGG
CTGGTCGGCG GCGGTGGCGG AACTGTAAGC GATGCTTATG CCACCGGCGA CGTAACCGGG
AACAATTATG TCGGCGGACT GGCTGGAACT GCCGGCGGCA CCTACACCAA CGTCTATGCG
ACCGGAAATG TGACCGGCGT CCAGACCGTA GGCGGCCTCA TCGGCTACAG CAACGGCGTC
AAGTTGAACA CCGCCTATGC CAGTGGAAAC GTTACAGGCG CGGGCCAGGA CGTCGGCGGA
CTTATCGGTA TGAGCAATTA TGGAACGCTC ACCAACGTTT CTGCATCCGG AACCGTGACC
AATACTGGAA GCTATACCGG CGGCATCGTT GGAAGGGTCG TGGGAACGAC GATCAGCGAT
GCGTCCTTCT CCGGAACAGT GACCGGGGCC TTCGGCACCG GCGGCATCGC GGGCTTCAAT
GGGGGAACCA TCACCGATTC CCAGGTGACC GGGAGCGTGA CGGGTACCGC CGCCGTAGGC
GGTATCGCCG GCCTCAATCT CGGCGGAACC ATCAGCAACG TTCTGTTTGC CGGCAGCGTG
ACAGGCATCC TGACGGTGGG CGGTGTGGCC GGCACCAGCA CCGGCACCAT CGACAACGCC
ACCTCGACAG GAACGGTCTC CGGAACGAGC AATGTCGGGG GTATCGCCGG CAACAATCTC
GGTGGCATCA CCAATTCGTC TGCCACCGGC GCAGTCTCCG GGACGCAGAA TACCGGAGGG
CTGGTCGGCA ACAACATCGG CGCGGGCAAG GTTTCCAACA GCACCTGGAA CACCACATCC
ACCGGCCAGA CCGGAGGCGT GGGCAACGGC CAAGGGATCG GCTCCTCGGT GTCGGGCGTG
ACGTCGCCCA CGATCCCTGC CTATGCAACG GATGCTGTGC AATCGGCCGT GCGCGCAGCC
ACGGTCAGCG CCACCACCGG CGTGCAGGAG ACGGCGGACA ACCCGCCATC GACCTCAGAC
GTGGCGGCGG GGAGTTCGGC GACGGCAGCG CTTGCCGGGC CTTCGGTGGC CTCCCAGATC
GACAGCAGCG CAGCGCCGGC GCCCACCACC TCCTGGCGGC AGCTGCAGGA CCAGGAAGAG
CGCCGCAGGA AGCGCGCCGC ACAGCAGGTG CAGCCGGCTT CGGCACCGAA GGGCGGGGTT
GGCGGGTCCA TCCGCTCCAT CGACGTGGAC GGCCAACATT TCGACCTGGA GAAGGACTCA
GCGCCTGCGT CTTCCGCGCC GGCACAGCCG GCCGCTCCCG CTCCGGCGGC CGCACCGGCG
CAGTAA
 
Protein sequence
MRARAAVVPP APQMAGVPFS SPAVLGRRAR LTRLLCASTA LATLLANPAF AAPQGGTVVS 
GTATISTSGT TTNINQSTNK AIINWTNFSV AAAETVNFNQ PSAASVTLNR VIGNESSVIA
GAINANGQVF LVNSNGILFT GTSQVNVGGL VASTLDISNA DFLAGNYVFS GSSTASVVNR
GNISAASGGY VSLMGKSVSN EGVITATLGT VSLNAASKMT LNFEGNSLVD VTLDEGVLNA
LVENKQLIKA DGGKVIMTAK AADAVLSAQV NNSGIIQART MAALTGGSSA KTYKKGSITL
KADGGTTKVS GTLDASAPNG GDGGSIETSG NTVKISDAAT ITTLASSGNT GSWLIDPDGF
TIGRYAYWTR PWVRRAHDDG GDISASKLSS LLATTSVEIQ STDGSGTDGD ITVNAAVSWS
ANTTLTLTAT NDININAPIT ATGTSAGLNL NYGGDYYIAN GAAVTLSGAD ASLVMNGQAY
TLIHTMAQLA ALDDATGTAS GFYAIANDID ASAAAYDGPV IAKLSGTLAG LGHTVSGLTI
SSGGSSVGLI GSIGTGSKVR DLGLLDADIT GNSTVGALAG ENNGIVSNSY ATGAVTSLSS
GWAGTGGLIG ENYGTITGSH AAVTVTSSGN YVGGLVGYSY GIITGSYATG DVSGYMTVGG
LVGGGGGTVS DAYATGDVTG NNYVGGLAGT AGGTYTNVYA TGNVTGVQTV GGLIGYSNGV
KLNTAYASGN VTGAGQDVGG LIGMSNYGTL TNVSASGTVT NTGSYTGGIV GRVVGTTISD
ASFSGTVTGA FGTGGIAGFN GGTITDSQVT GSVTGTAAVG GIAGLNLGGT ISNVLFAGSV
TGILTVGGVA GTSTGTIDNA TSTGTVSGTS NVGGIAGNNL GGITNSSATG AVSGTQNTGG
LVGNNIGAGK VSNSTWNTTS TGQTGGVGNG QGIGSSVSGV TSPTIPAYAT DAVQSAVRAA
TVSATTGVQE TADNPPSTSD VAAGSSATAA LAGPSVASQI DSSAAPAPTT SWRQLQDQEE
RRRKRAAQQV QPASAPKGGV GGSIRSIDVD GQHFDLEKDS APASSAPAQP AAPAPAAAPA
Q