Gene Mext_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1400 
Symbol 
ID5832590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1574010 
End bp1576208 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content73% 
IMG OID641367200 
Productvault protein inter-alpha-trypsin subunit 
Protein accessionYP_001638872 
Protein GI163850829 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0242423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCGC TTCCCCACTC CCCGCTTCCT CTGCCTCCGC ATTCCCGATC ACCGCTCGCG 
CACCCGGCTC CGGGCCTGCC GCGCAACCTT CTGCTCGGGC CGCTGATCGC CCTCCTGCTC
GCCCTCATTG CCGGCATCCC GCCGGCAAGA GCGCAGGATT CGAGGGCGCA GGACGGCAGC
GGCACGCTGC TGCTGCGCTT CGAGGGCGGC GCGCCCGTCG AGGCGCCCCG CCTCAAGGCG
GACGCCGCGA TCAGCGTGAG CGGCCCGACC GCACGGGCGA CGATCACCCA GGCCTTCCGC
AACACGACGA GCGAGTGGGT CGAGGGCACC TACCTGTTCC CGCTGCCGGA GGATGCGGCG
GTCGATACGA TGAAGCTCGT GGTCGGCGAC CGGGTCATCG TCGCCGACAT CCGCGAGCGC
GCGGCGGCGC GGCGCGTCTA CGAGGCGGCA AAGGCCGAGG GCAAGGCCGC CGCGCTCACC
GAGCAGCAGC GGCCCAACCT GTTCACCAAC GCGGTCGCCA ATATCGGCCC TGGCGAGACC
GTGCTGGTGC AGATCGAATA CCAGCAGCCG GTGCGGTCCT CCGCCGGCAC CTATGCCTTG
CGCCTGCCGA CCGTCGCCGC CCCGCGCTAC AGCCCCGCTC CCCCGGCGGT AATGGCCGTC
GTCGAGCGGG GCGCCGCCGA TCCCGTCCCG GATCGCGAAA CGATCGCGGC CCCGGTGCTC
GACCCGGCCC GCCATGCGCC GATCAACCCG CTGACGCTCA CCATCGATCT CAAGGCCGGC
TTCACCCTGG GCCAGGTGCG CAGCGCCACC CATGCGGTCC GTATCGAGGA ACGGTCCGCG
AGCGAACGCC GCATCACGCT CGCCGACGGC GCCACCGCCG CCGACCGCGA CTTCGAGCTG
ACCTGGAATG CGGCCCCCGG CGAGGCACCC TCGATCGGCC TGTTCCGCGA GCGGGTGGCG
GGGGCTGAGG CAGTGCTCGC CGTGGTGACC CCGCCGGAAT CTGCGAGCTC GGCCGCATCC
GTGCCCCGTG ACGTGGTGTT CGTCATCGAC AATTCCGGCT CCATGGGCGG CGCCTCGATG
CGGCAGGCCA AGGCGAGCCT GCTCATCGGC CTCGACCGGC TGGGCGCGCA TGACCGCTTC
AACGTGATCC GCTTCGATCA CAGCTTCGAC ACGCTGTTTC CCGATCTGGT GCCCGCCGAC
GCGGGCCATC TGATGCGGGC CAAAAGCTTC GTGGCCGGGC TCCAAGCGAG CGGTGGCACG
GAGATGCTGG CGCCGCTCCA GGCCGCCCTG CGGGGCGCGA CGCCGGAGGA GACGGGGCGT
CTGCGTCAGG TCGTGTTCCT CACCGACGGT GCCATCGGCA ACGAGGCGCA GATCTTCTCC
GCCATCGCCA CGGAGCGCGG GCGCTCGCGG CTGTTCATGG TCGGCATCGG CTCGGCCCCG
AACGGCTACC TGATGCGCCA CGCCGCCGAA CTCGGTCGCG GCAGCTTCAC CCAGATCGAC
ACGCCCGATC AGGTGACCGA GCGGATGCGC GCCCTGTTGG TGAAGCTGGA AAGCCCCGCC
GTCACCGACC TGACCGCGAC CTTCTCCGAG CCCGGCATCG ATGTGACACC CGCCCGCCTA
CCCGACCTCT ACCGCGGCGA GCCGCTCACC CTCTCGGCCC GGATGGGGCA GGCCCGCGGC
ACCCTGACCC TCACCGGCCG GATCGGCGGA CAACCCTGGC AGACGCAGCT GCATCTCGAT
GCGGCACAGG ACGGGACCGG GATCGGCAAG CTCTGGGCGC GGGCGAAGAT CGCCGAGGCC
GAGACCGCGC GGCTGACCGG CGGGCTCACG GCGGAGGCGG CCGACGCGGC GATCCTGCGG
CTGGCGCTCG ACCACGGGCT GACGACCCGC CTGACCTCGC TGGTGGCGGT GGACGCCACC
CCGCGCCGGC CAGCCGGGAT GCGCCTCGCC AGCACCGAGT TGCCGCTCAA CCTGCCGGCC
GGCTGGGACT TCGAGACGGT GTTCGGCACG CAGGATGAGG CGCCGCAGCT TCCCCCGCCG
CCGCGCCAGC GCCGCGCCGC GGCCCCGGCT ACGCAGATCG CGGCCGCGCA AGCCGTGGCG
CTGCCGCAGA CCGCGACGGA TTTCGAGATC CGCGCATGGC TCGGAGCGCT GCTGCTGGCG
CTCGGTCTCG TCCTCTCCCG CCGCCGGCTC GCAGCGTGA
 
Protein sequence
MLSLPHSPLP LPPHSRSPLA HPAPGLPRNL LLGPLIALLL ALIAGIPPAR AQDSRAQDGS 
GTLLLRFEGG APVEAPRLKA DAAISVSGPT ARATITQAFR NTTSEWVEGT YLFPLPEDAA
VDTMKLVVGD RVIVADIRER AAARRVYEAA KAEGKAAALT EQQRPNLFTN AVANIGPGET
VLVQIEYQQP VRSSAGTYAL RLPTVAAPRY SPAPPAVMAV VERGAADPVP DRETIAAPVL
DPARHAPINP LTLTIDLKAG FTLGQVRSAT HAVRIEERSA SERRITLADG ATAADRDFEL
TWNAAPGEAP SIGLFRERVA GAEAVLAVVT PPESASSAAS VPRDVVFVID NSGSMGGASM
RQAKASLLIG LDRLGAHDRF NVIRFDHSFD TLFPDLVPAD AGHLMRAKSF VAGLQASGGT
EMLAPLQAAL RGATPEETGR LRQVVFLTDG AIGNEAQIFS AIATERGRSR LFMVGIGSAP
NGYLMRHAAE LGRGSFTQID TPDQVTERMR ALLVKLESPA VTDLTATFSE PGIDVTPARL
PDLYRGEPLT LSARMGQARG TLTLTGRIGG QPWQTQLHLD AAQDGTGIGK LWARAKIAEA
ETARLTGGLT AEAADAAILR LALDHGLTTR LTSLVAVDAT PRRPAGMRLA STELPLNLPA
GWDFETVFGT QDEAPQLPPP PRQRRAAAPA TQIAAAQAVA LPQTATDFEI RAWLGALLLA
LGLVLSRRRL AA