Gene M446_4341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4341 
Symbol 
ID6133403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4785196 
End bp4786542 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content71% 
IMG OID641644480 
ProductABC transporter related 
Protein accessionYP_001771118 
Protein GI170742463 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.253377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCC ACACCCCGGT TCCCCGCCCG CGCAGCTCCG ACGCGCCGCT CGTCTCCGTC 
GCGGGCGTCC AGCACCATTA CCGCAAGGGC AACGCGTCCG ACCTTCTGGT TCTCGACGAA
GTCAACGTCG ACCTGCGGGC GGGCGAGATC GTCTCCCTGC TCGGCCGCTC GGGATCGGGC
AAGTCGACCC TGCTGCGCAT CATCGCCGGC TTGATGCCGC CGAGCCGGGG ACGGGCGCTG
ATCAACGGCC GGCCGGTGAC CGGTCCGGCC CCCGAAGTCG CCATGGTGTT CCAGTCCTTC
GCGCTGTTTC CCTGGCTGAC CGTGCTGCAG AACGTGGAGG TCGGCCTCGA GGCGCAGGGC
GTGGCGCCGG CCGAGCGGCG CAAGCGCGCG CTCGCGGCGA TCGACCTCAT CGGCCTCGAC
GGGTTCGAGA GCGCCTATCC GAAGGAATTG TCCGGCGGGA TGCGCCAGCG GGTCGGGCTG
GCGCGCGCCC TCGTCGTCCA TCCCGAGATC CTGCTGATGG ACGAGCCCTT CTCGGCCCTC
GACGTGCTCA CGGCCGAGAC CCTGCGCACC GACCTGCTCG ACCTCTGGAT CGAGGCCCGC
ATCCCGACCC GCTCGATCCT GCTCGTGACG CACAACATCG AGGAGGCCGT CCTGATGAGC
GACCGCATCC TGGTCTTCTC GTCGAATCCC GGCCGGGTGG TGGGGGATAT CCGGGTCGAC
CTCCCGCAGC CGCGCAACCG CCTCGATCCG GCCTTCCGGG CGCTGGTGGA CGACATCTAC
GCCCGCATGA CCATGCGCCC GCCCCAGCCG GCCGGGCCGG GCGGGCACGG CAAGCCCGAA
GGGTTCCCGG GCACCGGCAT CGGCATGGTC CTGCCGCGCG TCTCGACCAA CCTGCTGGCG
GGCCTGATCG AGGCGGTGGC CGGCGAGCCC TACCGGGGCA CGGCCGACCT GCCGGCGCTC
GCGAGCACGC TGCAACTCGA AGTCGACGAG CTCTTCCCGA TCGCCGAGAC CCTGCAACTG
CTGCGCTTCG CCGAGCTGGA GGGCGGCGAC ATCACGCTGA CACCGGCGGG GCGCCGCTTC
GCGGAGAGCG CCGTCGACGA GCGCAAGCAG CTCTTCGCCC AGCACCTGAT CGCGTACGTG
CCGCTGGCCG CCCATGTCCG GCGCGTGCTC GACGACCGAG CCTCGCATGC GGCCCCGCGC
CGCCGGTTCC AGGACGAGCT GGAGGATCAC ATGTCGGCCC AGTACGCGCA GCAGACCCTG
CAGGCCGTGA TCTCCTGGGG GCGCTACGCC GAGGCCTTCG CGTATCACGA GGCGAGCGAC
ACGTTCAGCC TGGAGGATCC CGCGTGA
 
Protein sequence
MDAHTPVPRP RSSDAPLVSV AGVQHHYRKG NASDLLVLDE VNVDLRAGEI VSLLGRSGSG 
KSTLLRIIAG LMPPSRGRAL INGRPVTGPA PEVAMVFQSF ALFPWLTVLQ NVEVGLEAQG
VAPAERRKRA LAAIDLIGLD GFESAYPKEL SGGMRQRVGL ARALVVHPEI LLMDEPFSAL
DVLTAETLRT DLLDLWIEAR IPTRSILLVT HNIEEAVLMS DRILVFSSNP GRVVGDIRVD
LPQPRNRLDP AFRALVDDIY ARMTMRPPQP AGPGGHGKPE GFPGTGIGMV LPRVSTNLLA
GLIEAVAGEP YRGTADLPAL ASTLQLEVDE LFPIAETLQL LRFAELEGGD ITLTPAGRRF
AESAVDERKQ LFAQHLIAYV PLAAHVRRVL DDRASHAAPR RRFQDELEDH MSAQYAQQTL
QAVISWGRYA EAFAYHEASD TFSLEDPA