Gene M446_3612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3612 
Symbol 
ID6133711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4029180 
End bp4030403 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content72% 
IMG OID641643779 
Productpeptidase T 
Protein accessionYP_001770427 
Protein GI170741772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0032561 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0670454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTC GCGCGCAGCT GGTCGAGCGG TTCTTCCGCT ATGTCGCGGT TCCGAGCCAG 
AGCGACGCCG CCGCGGCGAC GCTGCCGAGC ACGCCGGGCC AGCGCGACCT CGCGGCCCTG
CTGGCCGCGG AGCTGCGCGA GATCGGGCTC GCGCGGGTGA CCCTCGACGA CGCCGCCATC
GTGACGGCGC TGAAGCCCGG CACCCGGCCG GGCGCCCCGC GGATCGGCTT CGTCGCGCAT
CTCGACACGG TCGATGTCGG GCTCTCCCCG GTGATCCGCC CGCAGGTGCT GCGCTTCGAC
GGCCGCGACC TCTGCCTGAA TGCCGAGCGG GACATCTGGC TGCGCGCCGC GGAGCACCCG
GAGATCCTGC CCTGGCGGGG CCAGGACGTG ATCGTCGGCG ACGGCACCAG CGTGCTGGGG
GCCGACAACA AGGCCGCCAT CGCGGTGATG ATGACGCTGC TGGCGACGCT GGGCCCGGCG
GACGCGCACG GCGACATCCT CGTCGCCTTC GTGCCCGACG AGGAGATCGG GCTGCGCGGC
GCCAAGGCCC TCGACCTCGC GCGGTTCGCC TGCGATTTCG CCTACACGAT CGATTGCTGC
GAGGTGGGCG AGGTCGTGCT GGAGACCTTC AACGCCGCCA ACGGCGAGGT CGTGTTCACC
GGGGTCAGCG CCCACCCGAT GGCGGCCAAG GGCGTGATGG TCAACCCGCT GCTGATGTCG
CAGGACTTCA TCGCGCAGTT CGACCGGGCA GAGACGCCCG AGCGCACCGC GGGCCGCGAG
GGCTATTACT GGTTCAGCGG GATGGTGGCG AACGACAGCG AGGCGCGGCT CCAGGTCCGG
ATCCGGGACT TCGACCGGGA CGCCTTCGCG GCCCGCAAGG CGCGGGTGGA GCGGGAGGCC
GCCCGCGTCG CCGCCCGCTA CCCGACCGGC CGCGTGGCCT GCCGCCTGGA CGACGTGTAC
GGCAACATCC GGGACTCGCT CGGGGACGAC CGGCGGGCGG TGGACCTGCT GTTCTCCGCG
CTGGCGGCGC TGGAGATCAC CCCGAAGCTG ATCCCGATGC GCGGCGGCAC CGACGGGTCG
GCCCTCTCGG CGCGGGGCGT GCCGACGCCG AACTTCTTCA CCGGCGCCTG CAATTTCCAC
TCGCGGTTCG AGTTCCTGCC GGTTCCCGCC TTCGAGGCCT CCTTCAAGGT CGCGCGCACG
ATCTGCCGGC TCGCGGCCGC CTGA
 
Protein sequence
MSIRAQLVER FFRYVAVPSQ SDAAAATLPS TPGQRDLAAL LAAELREIGL ARVTLDDAAI 
VTALKPGTRP GAPRIGFVAH LDTVDVGLSP VIRPQVLRFD GRDLCLNAER DIWLRAAEHP
EILPWRGQDV IVGDGTSVLG ADNKAAIAVM MTLLATLGPA DAHGDILVAF VPDEEIGLRG
AKALDLARFA CDFAYTIDCC EVGEVVLETF NAANGEVVFT GVSAHPMAAK GVMVNPLLMS
QDFIAQFDRA ETPERTAGRE GYYWFSGMVA NDSEARLQVR IRDFDRDAFA ARKARVEREA
ARVAARYPTG RVACRLDDVY GNIRDSLGDD RRAVDLLFSA LAALEITPKL IPMRGGTDGS
ALSARGVPTP NFFTGACNFH SRFEFLPVPA FEASFKVART ICRLAAA