Gene Mnod_4285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_4285 
Symbol 
ID7301184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4325554 
End bp4327500 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content75% 
IMG OID643601938 
ProductCollagen triple helix repeat protein 
Protein accessionYP_002499464 
Protein GI220924162 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.275713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAGG CTGACCGGCT CGCGGAGGCG GTGTTTCGCG CGGTGGAGGG CTATCTCGCC 
CGGACGGTGG GGCCGCTGCT CGCCCGGCTC GAGGCGCTGG AGAGGCGGGA GCCTCTGCGC
GGCGAGCGGG GTCACAATGG CCAGCCGGGC CGCGGCCTCG CGAAGGCCAT CGTCACCGCG
GATGGCCTGC TCGTCCTGAC GATGACGACG GGCGAGGAGC TCAGCGTCGG CCGCGTGACA
GGCAAGGACG GCAAGGACGG GGCCGATGGC GCGCCCGGCC GGGACGGCCG CGATGGGGTC
GACGGCGCGC CGGGCGAGCC GGGCCGCGAT GGCACGGATG GGGCGGATGG TCGGGACGGC
CGCGACTTCG ATCCGGAGCT TCTCCGCACC GCGGTGGTCG AGGAGGTATC CAAGGCGGTC
GACGCTATCC CGAGGCCGCG GGACGGTGCG CCTGGGCGCG ACGGCACGGA CGGGAAGGAC
GGACGGGATG GCAAGGACTT CGATCCCGAG GTGCTCGCGG CCGCTGTCGA GCAGGCCGTC
ACGAAGGCCC TCAGCCAGAT CCCTGTTCCG AAGGACGGGA CGCCGGGCCG GGACGGCAGG
GATGGCGTCG GCGTTGCGGG CGCCCTGATC GACCGGAGCG GCAAGCTCAT CCTCACTCTG
TCGAATGGCG AAACGCGGGA TCTCGGCTTG GTGGTTGGCC GAGATGGCAA GGACGGTGCC
GACGGCCGCG ACGGCCGCGA CGGCGCTCCC GGTGAGCGTG GCGAACAAGG AGAACCAGGT
CGCGATGGCA AGGATGGCAC CGACGGTCGG GATGGCCAGG ACGTCGAACC GGAGGCCCTG
CGGGTCGCTG TCGAGGAGGC GGTCACGCAG GCCGTCAGCC AGATCCCGCT CCCGAAGGAC
GGGACGCCGG GCCGGGACGG TCGAGACGGC GTGGACGGCA AGGACGGCGT CGGCCTCGCC
GATGCTCTGA TCGATCGAGC TGGCAATCTC GTGGTCACGC TGTCGAACGG CGACACCAAG
CAGCTTGGCC TCGTGGTCGG CCGGGACGGC AAGGATGGCC GGGCCGGCAG CGCCGGCAAG
GATGGTGCGC CGGGCGAGCG GGGGGAGCCG GGCCCGCGCG GCGAGCAGGG CGAGAAGGGC
GACCCCGGAG AGCGGGGCGA GCGAGGTCCC GCCGGCGAGC GGGGACCGGC CGGCGAGCCG
GGTCCGCCGG GCGAGCCGGG TCCGCCGGGC GAGCCGGGTC CGCTGGGCGA GCGCGGGCCA
CAAGGTGAGC GCGGCCTGCC GGGCGAGCCC GGCGCTGCGG GTGAGCAGGG TCCGCCCGGT
GAGCGCGGTG AACAGGGACC GCCCGGCGAT CGAGGAGAGC GTGGCGAGCC CGGCGAGCAG
GGGCCTCCAG GCGAGCGGGG CGAACGGGGG CCGCAGGGCG AGCCGGGTCC TCCTGGTGAG
ACGGGCGAGC GCGGTGAGCC GGGTGCTCCT GGTGAGCGCG GCGAGCGCGG GATACCTGGC
GGGCGCGGTG AGAAGGGCGA CCCGGGCCGC GACGGAAAGG ACGGCGCTCC TGGGGCCGCC
GGAGAGCGCG GCGAACGGGG CGAGAAGGGC GAAACCGGTG AGCGCGGCGC CGACGGCTTT
GGCTTCGAGG ACCTGGAGGA GGAGCTCGCC GAGGACGGCC GCACGCTTGT GCGGCGCTAC
CGCCGCGGCG AGGAGGTGAA GGAGTTCCGC CACCGCGTCC CGACGCTGAT CGATCGCGGC
GTCTACAAGG CGGGCACGAT CTACCAGCCC GGCGACGGCG TCACCTGGGC CGGCTCGTTC
TGGATCGCCC AGACGGAGAC CGACGCGAAG CCGGATGGCG GCGAGGGCTG GCGCCTCGCG
GTCAAGCGCG GCCGCGATGG CAAGGACGGC AAGCCGGGCG AGCGCGGGCC CGAGGGCAAG
GCCGGCCCTG ACGGCCGGAG GTGGTGA
 
Protein sequence
MDEADRLAEA VFRAVEGYLA RTVGPLLARL EALERREPLR GERGHNGQPG RGLAKAIVTA 
DGLLVLTMTT GEELSVGRVT GKDGKDGADG APGRDGRDGV DGAPGEPGRD GTDGADGRDG
RDFDPELLRT AVVEEVSKAV DAIPRPRDGA PGRDGTDGKD GRDGKDFDPE VLAAAVEQAV
TKALSQIPVP KDGTPGRDGR DGVGVAGALI DRSGKLILTL SNGETRDLGL VVGRDGKDGA
DGRDGRDGAP GERGEQGEPG RDGKDGTDGR DGQDVEPEAL RVAVEEAVTQ AVSQIPLPKD
GTPGRDGRDG VDGKDGVGLA DALIDRAGNL VVTLSNGDTK QLGLVVGRDG KDGRAGSAGK
DGAPGERGEP GPRGEQGEKG DPGERGERGP AGERGPAGEP GPPGEPGPPG EPGPLGERGP
QGERGLPGEP GAAGEQGPPG ERGEQGPPGD RGERGEPGEQ GPPGERGERG PQGEPGPPGE
TGERGEPGAP GERGERGIPG GRGEKGDPGR DGKDGAPGAA GERGERGEKG ETGERGADGF
GFEDLEEELA EDGRTLVRRY RRGEEVKEFR HRVPTLIDRG VYKAGTIYQP GDGVTWAGSF
WIAQTETDAK PDGGEGWRLA VKRGRDGKDG KPGERGPEGK AGPDGRRW