Gene Mext_4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4207 
Symbol 
ID5833243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4681821 
End bp4682861 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content67% 
IMG OID641369997 
Producttriple helix repeat-containing collagen 
Protein accessionYP_001641647 
Protein GI163853604 
COG category 
COG ID 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.532408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGC CAATCACTTT CAATAATACG ACGTATGCTC CCGGATTTCT AGGAACCGAC 
GACGGTGGTG CATCTGGAAA CTTTCAGGTC GATACCGCCT CGACCTATTC CGTCACAGTA
TCAGGGACCA TCAACGCTGT CGGTGATCCC GTCACTCTGA CCTATGGAGC CGATGCTCCC
GCCGGTTTTG CAGGCACGTC CGTTCAGTTG ACCTCGACGC AGTTCGACAA TTCAGGCCAG
ATCCTGTTCG TGAGCAGGGC CATCCCGCCC GGTGAGACGG AGACCGGCAA CTACCGCTAC
CTCCTCTCGA ACACCCAGGT GGTCGGCTCG AACCCGCCTC CCGGCTCCAC CCGGACCCGC
TTCCTCGCCG ACGGCAACAA CACGGCCGGC GATTACAACG TCCAGGCCGC GCCCTGCTTC
ACCACGGGCA CGCTCATCCG CACGGCTCGC GGCGAGGTGG CGGTCGAGGA TCTGATTGTC
GGCGATCTCG CCGTGACGGC TTCCGGCACG CTGCGTCCGA TCACCTGGAT CGGCAACCGC
GCCCTCGATG CCAAGGGCGA GGCGCTGCCC CACAACGAGC AGCCCATCCG GATCCGCGCG
GGTGCCTTCG GCCCCGGCCT CCCGGCGCGC GATCTGCGCC TCTCGCACGG CCATCCGGTG
CTCGTCGGCG CCGATGCCAA CGGCGAGGGC GGCGTGCTGG TGCCCGTGAT GTGCCTGATC
AACGGCACCT CCGTCCTCCG CGAGCCGGCG ACGCAGGTGA CCTACTGGCA TATCGAGCTG
GATGCGCACG ACATCCTGCT CGCCGAAGGT CTGGCCGCCG AGAGCTACTA CGACATGGGC
AGCCGCGTTT GGTTCGCCGG CGAGGACGGC ATGCTGACCG ATCCGGACTT CGTGCCGGCC
TGCGAGCACG GCCGCTGCCG CCCTGTGGCG GTGGACGGCG CCCTCGTGGA CGGTGAGCGG
CAGCGGCTCG ACGGCGTCTT CGCCGCGGAG CTCGATGGGC ACAGCGCCTG GGCCGACGCA
CCGGTGTGGC ACGCCGCGTA A
 
Protein sequence
MALPITFNNT TYAPGFLGTD DGGASGNFQV DTASTYSVTV SGTINAVGDP VTLTYGADAP 
AGFAGTSVQL TSTQFDNSGQ ILFVSRAIPP GETETGNYRY LLSNTQVVGS NPPPGSTRTR
FLADGNNTAG DYNVQAAPCF TTGTLIRTAR GEVAVEDLIV GDLAVTASGT LRPITWIGNR
ALDAKGEALP HNEQPIRIRA GAFGPGLPAR DLRLSHGHPV LVGADANGEG GVLVPVMCLI
NGTSVLREPA TQVTYWHIEL DAHDILLAEG LAAESYYDMG SRVWFAGEDG MLTDPDFVPA
CEHGRCRPVA VDGALVDGER QRLDGVFAAE LDGHSAWADA PVWHAA